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PREFACE 


Most undergraduates in physics know a great deal of electromagnetism, 
special relativity and quantum mechanics by the time they have completed their 
course, but practically nothing about modern theories of gravitation. Yet 
gravity is now one of the most exciting areas of physics, both experimentally 
and theoretically, in addition to its unique relation with at least the large 
scale structure of space and time. After some forty years of relegation to 
the realm of cosmology, relativistic gravitation has entered astrophysics with 
a vengeance following the discovery of neutron stars and, very recently, com¬ 
pact X—ray sources. While the revival of theoretical interest in gravitation 
has been fed by the successful explanation of at least the qualitative aspects 
of stellar evolution and the apparent inevitability of gravitational collapse 
at the end of the life of perfectly normal stars, the technology of the second 
half of this century has made possible experiments of hitherto unprecedented 
accuracy in the field of gravitation, some qualitatively new. Einstein’s 
theory of gravitation, general relativity, has been verified at the one per 
cent level, gravitational waves are being vigorously pursued and it seems 
quite likely that the compact X-ray source Cygnus X-1 contains a star so col¬ 
lapsed that even light cannot escape it, a black hole. 

The interested undergraduate has no access, other than semipopular articles, 
to any of this excitement, lacking (as do most professional physicists) both 
the mathematics of general relativity and the insight and experience necessary 
to clothe the mathematics with physical reality. I have written this book in 
the hope of purveying the flavour of the physical reality but without unfami¬ 
liar mathematics. The book is based on lectures I first gave to second year 
undergraduates at Oxford in 1974. My audience had completed courses in elec¬ 
tromagnetism and in special relativity, and I determined to tackle gravitation 
without employing on the one hand general curvilinear coordinates, or on the 
other the Lagrangian formulation of field theory: I have applied the elemen¬ 
tary tools of special relativity to the problem of generalising Newton’s theory 
of gravitation. The weakness of this approach is that it is not practicable 
to push beyond the weak field approximation to the Einstein field equations 
(although I did find a way of treating the precession of the perihelion of 
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Mercury) and it is not suited to a discussion of cosmology. Its strength is 
that it builds on material central to any physics course and should, I hope, 
not only deepen the understanding of these foundations but also bridge the 
apparent chasm separating gravitation from the rest of physics. It further 
has the advantage that although gravitation is treated as a classical field 
(and a field corresponding to mass-less spin tivo gravitons at that) the dis - 
to rtion of space-time by the p resence of mass em erges as a desirable y?^ of 
interpre ting the results . 

This approach is liable to incur odium on the grounds that it delays the incul¬ 
cation of the only proper mode of thought, namely the geometric treatment of 
general relativity. My defence against such a charge lies mostly above, but 
I should remark that a field theoretical approach to gravitation leads almost 
inevitably to general relativity i^en pursued to all orders in the strength of 
the gravitational potential, and it should always be remembered that physics 
is an experimental subject and it is not impossible that a pure geometric ap¬ 
proach might one day prove to be untenable. Anyone intending to develop a 
professional interest in gravitation must learn the mathematical language of 
general relativity: it is my hope merely to have provided a way in to the 
subject for those not yet so equipped, whether undergraduates or professional 
physicists whose training and modes of thought are subject to the same 
limitations as my own. 

The plan of this book is as follows. In Chapter 1 special relativity is 
briefly reviewed, with the emphasis on the Lorentz covariance of the equations 
of physics. There is then a short discussion of accelerations in the frame¬ 
work of special relativity. Gravity first appears at the end of this chapter, 

when the reader is confronted with two problems. The augmenting of Newtonian 
gravitation by mass-energy equivalence allows a calculation of the gravita¬ 
tional redshift, which comes out right, and a calculation of the deflection of 
light by the Sun, which comes out wrong by a factor of two. A simple minded 
application of the principle of unique acceleration in a gravitational field 
(often loosely referred to as the principle of equivalence) yields precisely 
the same answers. The two problems are, first, how can the gravitational 
deflection of light be a factor of two bigger than yielded by these calcula¬ 
tions, which get the redshift (an energy difference) right, and secondly, 
given that light deflected by twice the result of these naive calculations, 
how is it possible nonetheless to maintain that there is no way of detecting a 
gravitational acceleration by observations within a freely falling laboratory? 
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Chapter 2 contains a discussion of the Edtvhs-Dicke experiments Tvhich 
established the identity of inertial and gravitational mass, and the detailed 
conclusions that may be dravn from them. In Chapter 3 the equations of 
electrodynamics are derived by starting from the equations of electrostatics 
and requiring that the more general equations are Lorentz covariant: electro¬ 
dynamics is used as a model for gravitation, and this machinery is applied to 
gravity in Chapter 4, Mihere the field equations are set up \iith the conserved 
energy—momentum tensor as source. The observed deflection of light by the 
Sun is used to distinguish betiveen various a priori possible forms for the 
gravitational potentials. In Chapter 5 the machinery is applied to the force 
la'ws, yielding equations of motion, and the ansTver to the first problem raised 
in Chapter 1 is provided: there is a velocity dependent force acting at right 
angles to the motion which deflects a particle without changing its energy. 
The slowing down of light in a gravitational field, together with the equations 
of motion in free fall, is used in Chapter 6 to find the answer to the second 
question raised in Chapter 1. Fnom the point of view adopted in this book 
clocks are physically slowed down by a gravitational potential and measuring 
rods are physically contracted: effects which may be represented if desired 
as a distortion of space-time. 

Since gravitational energy is expected to be a source of gravity, the full 
gravitational field equations must be nonlinear and the work of Chapters 4-6 
is conducted only in the weak field approximation. Gravitational redshift, 
deflection of light and radar echo delay may all be discussed in these terms, 
but the fourth famous test of general relativity, the precession of the peri¬ 
helion of Mercury, may not be. This is the subject of Chapter 7, in which 
the weak field equations of motion are augmented by a nonlinear term. This 
term is derived by requiring that the characteristic times and distances of 
gravitationally bound systems are affected by an external gravitational poten¬ 
tial in the same way as atomic periods and sizes, a form of the principle of 
strong equivalence. Since celestial mechanics tends to be neglected in under¬ 
graduate physics courses, the phenomenon of precession and the extraction of 
a numerical value are treated in terms of radial oscillations superimposed on 
a circular orbit. 

In Chapter 8 the weak field equations are again employed in discussing the 
nature of gravitational radiation, its generation and detection. The dis¬ 
covery that the radiation fields associated with the theory giving twice 
the Newtonian deflection of light are gauge invariant gives a theoretical 
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motivation for this choice of theory and makes the connection \gith mass-less 
spin tTvo gravitons. 

The last two chapters are necessarily disjoint from the rest of the hook. In 
Chapter 9 an attempt is made to establish the connection with general relati¬ 
vity, but the Einstein field equations are neither derived nor solved. The 
relation between the metric tensor and gravitational potentials is discussed, 
and the Schwai^zschild solution in both standard and isotropic coordinates 
used to discuss the experimental tests of general relativity. The last 
chapter contains a brief discussion of black holes. The Schwarzschild solu- 
tion is used to treat the propagation of light and particle motion in very 
strong fields and the book itself ends with a short discussion of how black 
holes may manifest themselves to the astronomer. 

In constructing this approach to gravitation, I found the following works 
particularly useful: 'An alternative approach to the theory of gravitation', 
W.E. Thirring, Annals of Physics, 96 (l96l); 'Lectures on Gravitation', 

R.P. Feynman, (1962-63), (unpublished lecture notes); 'Gravitation without a 
principle of equivalence', R.H. Dicke, Rev. Mod. Phys., 363 (1957). In 

preparing the last chapter I benefited from 'Black hole physics', R. Sexl, 
CERN Report TH1759 , (1973). 

I am much indebted to I.J.R. Aitchison, S.J. Orebi Gann, J.C. Miller and F.N.H. 
Robinson for their critical reading of the draft of this book and innumerable 
helpful comments. 


CHAPTER 1 

SPECIAL RELATIVITY AND ACCELERATIONS 


1.1 Special relativity in brief 

The principle of relativity asserts that there is no meaningful way of 
defining absolute velocity. In this form it has little physical content but 
we can express it in more physical terms: the laws of physics are the same 
for all observers in uniform relative motion. Such observers inhabit inertial 
frames of reference: the definition of an inertial frame is that in an inertial 
frame a particle moves in a straight line at constant velocity unless it is 
acted ijpon by a force. This is not a circular statement: if there is a force 
something is present to produce it. 


A coordinate transformation connects observers in different frames. If the 
principle of relativity is correct (and this must be decided by experiment) 
then a given set of equations expressing a piece of physics in one frame, sub¬ 
jected to such a transformation, retains its form and numerical content. In 
Newtonian mechanics, the equation 


which we may write as 


F = mi 


F. = ma. 
1 1 


( 1 . 1 . 1 ) 


retains its form and numerical content under arbitrary translations and rota¬ 
tions of the coordinates. It also retains its form and numerical content 
under the Galilean transformation connecting two inertial frames 


= X - vt 


z = z 
t' = t 


( 1 . 1 . 2 ) 

y = y X = Z 

(augmented if so desired by translations and rotations). This transformation 
relates the coordinates of a given event in the primed inertial frame to the 
coordinates of the same event in the unprimed frame. The velocity v is the 
relative velocity of the two frames, for 

and — , = +v (1.1.3) 


dx^ 

dt' 


The spatial separation of events occurring at the same time is the same in both 
frames (an invariant quantity) and the acceleration of the test particle of 
mass m is the same regardless of which set of coordinates is used. That is 
for a given force any two observers in uniform relative motion measure the 


1 
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same acceleration. A physical laiv -which retains its form and numerical con¬ 
tent under a particular transformation is said to he covariant \vith respect 
to that transformation. 


The equations of electromagnetism are not covariant -with respect to the 
Galilean transformation. Consider for example the electric field of a plane 
-wave 


The phase is 

and the phase velocity is 


E = E sin(kx- at) . 


cp = kx - OJt 


dx 

dt 



(1.1.4) 


The magnetic field is in phase with the electric field: -when the phase is 
zero there is no field and all observers should agree on this. The phase 
should be an invariant and ive can use this to \vork out \^hat happens to k and 
0) under the Galilean transformation, that is to relate the values of -wave¬ 
length and frequency perceived by one observer to the values perceived by 
another moving relative to the first. The phase being an invariant, 

k^x'-oj^t' = kx-a)t 


must hold for all positions and times. 


If x' and t' are given in terms of x and t by the Galilean transformation, 
Tve may equate coefficients of x and t and obtain 

k' = k 


vk' +0)' = 0) 

whence 

0 )' = 03 ( 1 -^) 

and the phase velocity in the primed system is 



We have obtained the Doppler shift and the expected relationship bet-ween the 
velocity of light in the two frames. The fact that the velocity is differ¬ 
ent itself demonstrates that the equations of electromagnetism are not cova¬ 
riant with respect to the Galilean transformations. The point is rubbed in 
by noting that as v-^ c , uu'-^O and in the primed frame the electric field 
oscillates sinusoidally with position and is constant with time. Such a 
field is not a solution of Maxwell's equations. 
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vjelocl^ of light, as measured ivith real apparatus, is a 
universal constant for all unaccelerated observers. We must either suppose 
that velocity does something funny to clocks and measuring rods or that the 
transformation relating the coordinates (x^ i t^) to the coordinates (x , t) is 
not the Galilean transformation. If the velocity of light is a universal con¬ 
stant regardless of the nature of the apparatus employed, these choices become 
operationally indistinguishable. Adopting the second choice, we can obtain 
the correct transformations by abstracting from Maxwell’s equations one prop¬ 
erty: that the velocity of light is a universal constant. 

Consider two frames of reference, each equipped with identical apparatus, but 
moving with velocity v relative to each other along a common x axis. (We 
can always make a rotation of coordinates so as to achieve this.) For con¬ 
venience choose x^ = x=0 when t^ = t=0. 


Let the coordinate x' be related to the unprimed coordinates through 


= a X + a y + a z + a t 

11 12 13 1^ 


and write in general 


X = a X 

li liV V 


(1.1.5) 

where we employ the convention of summation over repeated indices (such a 
repeated index is called dummy) . We expect the coefficients depend 

only on the relative velocity v . This is a linear transformation connecting 
the coordinates of a given event (such as a wavefront reaching a specified 
detector) in one inertial frame with the coordinates of the same event in 
another inertial frame. The transformation is linear because only for a 
linear transformation is the unaccelerated motion of a particle in one frame 
seen as unaccelerated motion in the other (see Chapter 9). 


The equation of an expanding wave front in the unprimed frame is 


2 .2 


X + y + z - c"^ t 


0 . 


(1 1 . 6 ) 


If we choose to define 
in the compact form 


coordinates 


X 




(x , y , z , ict) this can be expressed 


X X = 0 . 

li 


(1.1.7) 


In the primed frame, related to the unprimed frame by 



a X 
IJV V 


(1.1.5) 


we must have 
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and so 


x' x' = 0 

P P 

x' x' = a a xx=xx 

p p pfi pv ^ V a a 


This gives us at once the relation 


a a =6 
Pii pv IJ.V 


( 1 . 1 . 8 ) 


where 6 


is the Kronecker delta function, having the value 1 if |i = v 
and zero otherwise. 

For transformations along the mutual x axes we can write a further relation. 
We have from (1.1.5) 

= « 1 V \ • 

Differentiate with respect to x^ for fixed and get 


*^1 


(1.1.9) 


This equation relates the transformation coefficients to the relative velocity 
of the two frames, and with eq. (1.1,8) determines the transformations. We 
first note that for motion along the mutual x axes only the x and t coor¬ 
dinates can he mixed without inconsistencies arising and so the relation 


X ^2 + ^^2 

(x'2 _ _ c^t^) 

gives, on equating coefficients, 

afi + = 1 

.2 . ^2 


«14 *^44=^ 
®11«14 + «41^44 = ° 


which is just eq. (1.1.8) written out explicitly for this case. 
If we call the velocity parameter V , 


"11 


®14““ ® 



V = ^ 

1C 

1 

1 

1 

44-^ 

/i + v2 

" ,/ i z! 

\ 1 2 

c 


-V 

- v/ic 


/l + 

CM ICM 

>1 O 

II 


then 
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so that 


xi = 
1 


xi - VX4 


’/TT 


(1.1.10) 


/ ^^1 ^ ^4 

""4 ^/ 1 + V2 

/ X - vt 


t' = 


Vl ^ v^/c^ 

t-vx/c^ 

Vl - v^/c^ 


( 1 . 1 . 11 ) 


\^hich are the familiar Lorentz transformations. 

We no-w note that any quantity 

Ax Ax 

has the same numerical value in any inertial frame and is thus an invariant. 
The quantity Ax^ transforms according to 


K 


a Ax 
IJLV V 


and is the prototype four-vector: its length is the prototype invariant. 

Consider a particle at rest in one frame. Over any interval of time At its 
spatial coordinates in that frame do not change and so 

Ax' Ax' = =-c^ At2 . 

In any other frame moving with velocity v along the x axes the particle 
moves a distance Ax in time At and so 


Ax^ - c^ At^ 


: - c^ At^ 


At 


-TT = V and so 


At = 


At 


Vl - v^/< 


( 1 . 1 . 12 ) 


where At is the proper time interval: the time interval elapsed in the cen¬ 
tre of mass of the particle. This is the famous formula for time dilation. 
Time (as measured with real standard clocks) elapses more quickly in the labo¬ 
ratory frame than in the rest frame of a high energy particle passing through 
the laboratory. 

The same result can be obtained at once from eq. (1.1.11) if we note that the 
X coordinate does not change in the rest frame. The recipe is quite unambi- 
the mean life of a particle moving with respect to an observer is 


guous: 












6 


GRAVITATION AND RELATIVITY 


greater than the mean life of an identical particle at rest with respect to 
the observer 


At' = 


At - 


vAx 


At 


1^-4 

'V 


/I - 


if Ax = 0 in which case At is the proper time interval At. 

Suppose that at time t = t'=0 a high energy particle passes through a coun¬ 
ter at x' = x=0. The firing of the counter constitutes an event. Later 

the particle passes through a second counter and decays in it. The simulta¬ 
neous firing of the second counter and the decay of the particle constitute a 

second event. This event also occurs at x'=0 in the particle frame, hut 

at time t' . In the laboratory frame it occurs at x, t . Then 


0 = 


X - vt 


X = vt 


1 - 






In time t' the point x=0 has moved hack a distance vt' in the particle 
rest frame. The distance measured in the particle rest frame between the two 
counters is thus 


:'=vt / l-^ = 


1 - 


This is the Lorentz contraction, which is however not susceptible to measure¬ 
ment in the direct way the time dilation is. 


¥e now define any quantity with four components V^ which are mixed together 
under the Lorentz transformations according to 


V' = 
l-L 


a V 
1-LV V 


(1.1.13) 


to be a four-vector. 


Then 

V' V' = 
1-L 


a a V V 

\lp f> O 


V 


V 

V 


using eq. (1.1.8). The length of any four-vector is an invariant. If we 
have two different four-vectors the analogue of the scalar product of two 
ordinary vectors is 
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A'B'=a a AB=AB 
l-L \Jip \XO P (J V V 


and is also an invariant. 


10 ) \ 


Thus in special relativity ve can see at once that the quantities V—’ 
constitute a four vector, since 


k • X - 0)t 

must he an invariant. This at once gives us the relativistic Doppler effect. 

We have not demonstrated that the equations of electromagnetism are Lorentz 
covariant (^e shall do this in Chapter 3 : for a conventional treatment see 
ref. [l]) hut ^/fe may define fields Tvhich are four-vectors V^(x^) and four- 
scalars S(x^). We are therefore interested in the effects of differential 
operators on these fields. We might expect that the four quantities 

3S 
3x . 


make up a four-vector. We may write 

3s' as 






Sx 


ax' 


(1.1.14) 


To evaluate the quantity dx^/Sx^ we need the inverse of 


X = 

l-L 


\ X 
liV V 


(1.1.5) 


Multiply these four equations hy the appropriate add; this is 

represented hy 

a x' = a a ^ X 
lip li ^p liV V 


and since 


a a =6 
lip liv pv 


a X 
liV li 


( 1 . 1 . 8 ) 


and 


Then 


ax' 

as' 

ax' 


liv 


liv 


as 


(1.1.15) 

(1.1.16) 

(1.1.17) 


and the four quantities aS/ax do indeed make up a four-vector : an equation 
of the form 

is thus Lorentz covariant. 


= V 


(1.1.18) 
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Similarly we might expect BV /b x to be invariant. Write 

|a 


^ = A{a V) = 

*- ijv vJ 

9V 3V 


3v ax 

> —e 

UV axp 


1J.V HP axp ax^ 

and indeed we have an invariant. An equation of form 

bV 

^ = s 

is Lorentz covariant. 


(1.1.19) 


( 1 . 1 . 20 ) 


Similarly it is easy to show that 






S 


is a scalar field and 


is a four-vector field. 


3 




V 

V 


The invariant operator 





( 1 . 1 . 21 ) 


is the generalisation of the Laplacian and is called the D'Alembertian opera¬ 
tor, frequently denoted by Q . 


If we accept that the equations of electromagnetism are true for all observers 
in inertial frames and are Lorentz covariant, then either we must discard the 
principle of relativity, or the equations of particle mechanics must also be 
Lorentz covariant. Newton's laws are covariant with respect to the Galilean 
transformation and must therefore be modified if the principle of relativity 
is to hold. 


Since the proper time t elapsed in the rest frame of a particle is an inva¬ 
riant, the quantities dx /dT (where x are the coordinates of that parti- 
cle in any inertial frame) form a four-vector* If a particle is moving 
slowly, T-*t and the first three components become the velocity. We may 
therefore call this quantity the four-velocity. A second differentiation 
provides us with a further four-vector, the four-acceleration d'^x^/dT . 

Multiply the four-velocity by an invariant quantity with the dimensions of 
mass and we obtain a four-vector which is called the four-momentum 
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p = m - - . 


( 1 . 1 . 22 ) 


For slow motion the first three components may be identified with the momentum 
in Newtonian 
i particles 


in Newtonian mechanics, which is conserved. Since p is a four-vector, for 

M" 


) p^ = ' a p = a P • (1.1.23) 

i 1 i 1 ^ ± 

If the four-momentum is conserved for any observer in an inertial frame, it is 

conserved for all such observers. We may express p^ in terms of the velo¬ 
city of the particle v in any given inertial frame : 


dx dx. 
n —1^ 
o dT 


(1.1.24) 


where x. = ict and 
4 


thus obtaining 


dT = Jl- dt 


P = 


m V 
o ~ 


If we define p^ = i 


. E 


'i ^ 

' ■ c 2 


m 1C 
o 


/ 1 - 


then 


E = 


m c 
o 


/i 

V - ^2 


2 1 2 
m c + ^m V 
o o 


(1.1.25) 


(1.1.26) 


in the low velocity limit. The implication of these equations is that energy 
has an inertial mass and conversely that inertial mass is a manifestation of 
energy, the two being linked by the relation E = mc^ . The analogue of 
Newton’s laws is thus 


F =r m 


d^x 


dT'^ 


(1.1.27) 


where F is a four-force. 
1-L 


The invariant quantity m^ , the proper mass, is given by the square of the 
four-momentum 


dx dx 

p p = m^ = - m^ c^ 

^p o dT dT o 


2 E ^2 ^2 
3 - —rr = - m C 


(1.1.28) 

(1.1.29) 


We may note that since the square of a four-vector is an invariant, only three 
of the components are independent. In particular, if 
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then 


dp 

— = F 
dt - 


it = 


Tvhere 


dx 

~ = dt 


If we choose units such that c = 1 then £ = Ev and 


dp 


dv 


dE 


So 


and so 


dt - ® <3t - at ' 


dv 


F - vv . F 


dv 

dt 


1 ^ 
c2 


m 




(1.1.30) 


(1.1.31) 


A particle exposed to a constant force in the laboratory accelerates less and 
less as the speed builds up. 


1.2 Special relativity and accelerations 

In accepting both the principle of relativity and the Lorentz transfor¬ 
mations which were enshrined within it by Einstein, our notions of the proper¬ 
ties of space and time have been changed. Along with these changes go changes 
in our notions of the properties of velocity, acceleration, force, mass, momen¬ 
tum and energy. Every experiment in particle physics, conducted at energies 
significantly greater than the rest mass energies of the particles involved, 
bears witness to the applicability of the Lorentz transformations to the laws 
of mechanics. Indeed particle physics tells us more. The tentatively 
constructed hypotheses concerning the interactions of the denizens of the 
microscopic world are all written so as to be covariant under the Lorentz 
transformations, and embody quantum mechanics as well as special relativity. 

The full range contains quantum electrodynamics (the most precisely checked 
of all physical theories), the theory of the weak interactions responsible for 
P-decay, innumerable field theories of greater or lesser physical significance, 
the S-matrix description of the strong interactions which hold the nucleus 
together and nowadays the first steps towards theories of the internal struc¬ 
ture of the strongly interacting particles themselves. Nowhere is there 
evidence that the Lorentz transformations are not applicable, even when dealing 
with particles whose energies are hundreds of times their rest mass (protons 
at the Fermi National Accelerator Laboratory at Batavia, Illinois) or tens of 
thousands of rest masses (electrons from the linear accelerator at Stanford, 
California). 
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The Lorentz transformations link the coordinates of a given event as seen from 
two different inertial frames of reference. We already defined an inertial 
frame of reference as one in which a test particle moves with constant veloci¬ 
ty unless acted on by a force: if we wish to be picturesque we may opera¬ 
tionally define an inertial frame of reference as one in which it is possible 
to play three dimensional billiards. 

A frame of reference which is being accelerated by rockets firing is clearly 
not an inertial frame. This has given currency to the erroneous notion that 
special relativity is incapable of discussing the laws of physics experienced 
by accelerated observers. This idea is wholly incorrect: within the postu¬ 
lates of special relativity we have an unambiguous recipe for discussing such 
observers. The crucial point is that space and time intervals as measured 
by different observers depend only on relative velocity and not on accelera¬ 
tion. This is built in to the definitions of four-velocity and four- 
acceleration on which covariant equations of motion involving accelerations 
are constructed. Thus while acceleration may break a given clock, the rate 
at which time elapses on a moving particle differs from the rate at which 
laboratory time elapses by a factor depending only on the velocity and not on 
the acceleration. The applicability of Lorentz covariant equations of motion 
to physics already reveals this. 

The misconception that special relativity is helpless in the face of accelera¬ 
tions arises most poignantly in the so-called twin paradox. Castor is an 
astronaut and visits a suitably distant star, say Sirius, and returns. His 
journey is made at very high constant velocity, apart from brief periods of 
acceleration, with respect to his brother Pollux who remains at Starbase some¬ 
where near Pluto. We may infer that on arriving home Castor finds his chrono¬ 
meter registers some 20 years less elapsed time than the identical one in Space 
Control and that his brother has aged some 20 years more than he. An alterna¬ 
tive scenario allows Castor to accelerate at a reasonable rate, say Ig , for 
half his journey, turn his spacecraft round and decelerate for the remaining 
half, the return trip being accomplished in like manner. The result is essen¬ 
tially the same. 

These results are derived in the following way. Pollux uses the special 
relativistic formula for time dilation, which depends only on the relative 
velocity, to work out how much slower Castor’s proper time is elapsing. He 
is in an inertial frame and knows that if special relativity is correct 
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Castor's clocks are keeping the same time with respect to an instantaneously 
comoving frame as his would keep if subjected to Castor's acceleration, as 
seen from the comoving frame. He can do experiments with such clocks and 
finds that for accelerations tolerable to humans a properly constructed clock 
measures time independently of its acceleration (although of course not inde¬ 
pendently of the velocity). Pollux thus arrives at an unambiguous answer to 
the problem of relative ageing. 

Castor however cannot directly apply the formulae of special relativity be¬ 
cause he has been accelerated over some parts of his journey - and he knows 
it because in addition to clocks he is equipped with accelerometers. These 
accelerations remove the symmetry between the two observers that would other¬ 
wise preclude a differential ageing, but they do not affect the proper rate of 
the accelerated clocks. Castor can apply the formulae of special relativity 
provided that he takes account of the fact that during the periods of accelera¬ 
tion he was continuously changing his own instantaneous inertial frame. 

The twins could have worked out together the recipe for doing this before 
Castor ever left Starbase. The construction of such a rule book [2] is of 
course dependent on space-time transformations of relativity (instantaneously) 
not depending on accelerations : from the point of view of an observer in an 
inertial frame acceleration may be interpreted as the accelerated system chang¬ 
ing inertial frames and it may be tracked by a continuously changing Lorentz 
transformation. This interpretation is iii 5 )licit in the description of the 
physics of systems involving accelerations in terms of Lorentz covariant equa¬ 
tions of motion : a description which is successful both at very high velocity 
and at enormous acceleration. The relativistic definitions of energy and 
momentum (1.1.22) hold at enormous acceleration: the implication is that the 
relativistic definitions of space-time intervals also hold at enormous accel¬ 
eration. 


It is instructive to work out the accelerations involved in quite ordinary 
physical systems which are well understood in terms of special relativity. 


(l) Atomic and nuclear structure 

(a) In a hydrogen atom the rate of change of velocity 


V 

r 


m e 
e 


m r 
e 




On putting in mumbers this acceleration is found to be 


V of an electron is 
( 1 . 2 . 1 ) 

lO^^cms”"^. (The 
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velocity of the electron is ^ 2 X 10® cm s ^ and the orbital period is 
- 10-^® s.) 

The precise tests of quantum electrodynamics carried out for the hydrogen atom 
yield agreement between theory and measurement at a level of around 5 parts in 
10® for the Lamb Shift and 5 parts in 10® for hyperfine splitting [3] . 

(b) Under the heading of electrons in atoms, we may include a further effect: 
the Thomas precession of electron spin. An electron moving through the elec¬ 
trostatic field of an atomic nucleus experiences in its instantaneous rest 
frame a magnetic field which interacts with the magnetic moment of the elec¬ 
tron and causes a precession of the spin. The precession frequency calcu¬ 
lated in the comoving frame is twice that observed in the laboratory. The 
reason is the existence of a term due entirely to special relativity and not 
due to any particular interaction, the Thomas precession. In the semi- 
classical calculation of the electron spin precession, the torque and the 
precession rate are evaluated in the instantaneous rest frame of the electron. 
The instantaneous rest frame is however precessing with respect to the labora¬ 
tory frame in which the nucleus is at rest, by an amount that can be calcula¬ 
ted straightforwardly from special relativity C43, The evaluation of the 
precession rate in the instantaneously comoving inertial frame of the electron, 
plus the transformation back to the laboratory using the Lorentz transforma- 
ions gives the right answer for the net precession observed in the laboratory, 
and moreover, as it must, agrees with the answer obtained from the Dirac equa¬ 
tion [ 5 ] in which the interaction of a spin particle with an electromagnetic 
field is written in an explicitly Lorentz covariant way. The acceleration as 
observed in the laboratory is again 10^® cm s 

(c) Nucleons in a nucleus. 

Nucleons in a nucleus are confined with 10 cm by the strong interactions. 
Their momentum is given by the uncertainty principle 



so their velocity is ^10^°cms The acceleration they experience is thus 

10®® cm s”^. Bound systems provide very large accelerations over large time 
scales. 

( 2 ) Collision phenomena 

(a) In elastic scattering through the strong interactions a particle moving 
with velocity c can experience a change of velocity of magnitude c due 
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to forces with a range of less than 10“^® cm . The acceleration experienced 
is thus ^lO^^cms”^. 


Tests of quantum electrodynamics with colliding beams of electrons of energy 
several GreV show no breakdown at momentum transfers corresponding to distances 
^ 10"^^ cm , and hence accelerations (for 90° scattering) 10^° cm s”"^ [g]. 


(b) In the bremsstrahlung process an electron radiates a photon as the 
result of acceleration in the field of an atomic nucleus. The change in 
velocity in such a process is '^m c^E for an electron energy E , correspond- 

p 

ing to a change of momentum m_c • This takes a time At — where r is 

c c 

the distance of closest approach and 


So that 


Ap 


At 


Ze^ 


At 


Z e^^^ 

c^ At 


Ze^ 


(1.2.3) 


e 

which for Z = 1 is 10“^° secs and for Z = 100 is 10“^^ secs . For 
electrons of energy 1 GeV , Av ^ 10^ cm s”^ and the mean acceleration is (only) 
^ 10^° cm s“^ • 


Collision phenomena provide very high accelerations but only for very short 
periods. 


( 3 ) Particle accelerators 

The accelerations we encounter here are less impressive, but are of interest 
since if our Lorentz covariant laws of physics broke down at high accelera¬ 
tions our accelerators would not work. The fact that they do work however is 
no precise verification of Lorentz covariant physics because accelerators are 
tuned to work and breakdown of Lorentz covariant physics below the V^/o level 
would probably be tuned out. A few examples will suffice : 

(a) Protons accelerated in the proton synchrotrons at CERN or Brookhaven 
reach an energy 30 GeV (^ 30 times their rest mass energy) in ~ls . Since 
the end velocity is c the mean acceleration is ^^3 X 10^° cm s”^, and since 
the particles actually experience the accelerating electric fields for 10“^ 
of the acceleration cycle, the peak accelerations are '^lO^^cms”^. The pro¬ 
tons move in circular orbits constrained by magnetic fields and of radius 
^lO'^cm so at full energy the acceleration normal to the motion is 10^’^cm s"^. 
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(b) Electrons accelerated in the Stanford linear accelerator surf ride the 
electric fields of waves travelling in a waveguide. The waveguide is loaded 
in such a way that the velocity of the travelling electric field always matches 
the velocity of the accelerated electron, as calculated from the equations of 
special relativity (l,1.30). The accelerator works: electrons emerging after 
two miles in the laboratory have an energy of 4 X lO'*' times the rest mass 
energy and a velocity equal to c within one part in 10 . The velocity has 
been explicitly checked at the level of a few parts in 10 ^, Lt]. 

In view of these successes of special relativity in describing physics at 
enormous accelerations we may confidently predict that an accelerated clock 
runs slow with respect to an unaccelerated clock by an amount given by equa¬ 
tion (1.1.12). This effect has been directly measured with the CERN muon 
storage ring. 

The muon is a particle of mass 105.7 MeV/c^ (206.8 electron masses) which 
appears to behave in all respects like a heavy electron. In particular the 
electromagnetic interactions of the muon follow quantum electrodynamics down 
to the smallest distances so far probed experimentally ('^ 10“^^ cm), and at 
this level it is still behaving like the point charge of QED, Because the 
muon is more massive than the electron, it may decay through the weak inter¬ 
actions into an electron and two neutrinos and it has a proper lifetime 
T = 2.2 X 10”® s . The rate of decay of a point-like particle would appear to 
provide an ideal clock: within our knowledge of the weak interactions there 
are no internal workings to be affected by acceleration and a point—like parti¬ 
cle can be assigned (instantaneously) a single comoving inertial frame. (This 
is not the case for an extended system: see Chapter 9.) 

In the CERN g-2 experiment muons were injected into a ring of 5m diameter 
and constrained to approximately circular orbits by a magnetic field of just 
over 17 kg. The object of the experiment was a precision measurement of the 
magnetic moment of the muon but the lifetime of the orbiting muons was a 
highly interesting byproduct [s] . The momentum of the stored muons was 
1.27 GeV/c , their energy =^12 rest masses. If the rate at which time elap¬ 
ses in the accelerated muon rest frame is indeed independent of the accelera¬ 
tion, the lifetime observed in the laboratory should be 



26 |jsec instead of 2.2 jOsec . 
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The muons vere tracked over some 150 psec, more than 10^ revolutions. The 
lifetime measured in the laboratory was found to he 26.37 ±0.05 sec, to be 
compared with the calculated value of 26.69 psec. The discrepancy agrees 
with the estimated effects of muon losses in the course of the orbiting. 

The laboratory acceleration of the muons was 4 X 10^® cm s“^. This work 

thus provided the experimental coup de grace to the interminable twin para¬ 
dox (which is dead but won't lie down). 

Lorentz covariant laws of physics work beautifully not only at velocities 
which are within one part in 10® of that of light, but also at accelerations 
in excess of 10^° cm s Mass, energy, momentum and velocity are all 

(instantaneously) acceleration independent. Electric and magnetic fields act 
on an accelerating particle according to prescription and we have direct 
experimental evidence that the rate at which proper time elapses is accelera¬ 
tion independent. 

Thus if we want to work out the physics of an accelerated laboratory, as seen 
by an observer in that laboratory, all we have to do is to work out the 
physics in a given inertial frame and then transform to the instantaneously 
comoving frame to find what our accelerated observer will instantaneously see. 
This of course is exactly what is done to discuss the physics of accelerated 
laboratories within the framework of Newtonian physics and leads to the 
introduction of centrifugal and coriolis forces. ¥e must however use Lorentz 
covariant physical laws and the Lorentz transformations, at least if we are 
studying any phenomena involving high velocities. We must also recognise 
that in addition to the continuous transition between comoving frames that 
occurs in acceleration, for extended systems different parts may be in differ¬ 
ent comoving frames. It is the fitting together of all these different comov¬ 
ing frames as a function of space and time coordinates that constitutes the 
generalisation of special relativity into general relativity. But general 
relativity also embodies a theory of gravity. We have not mentioned gravity 
at .all yet, because we have not so far attempted to write down Lorentz cova¬ 
riant laws of gravitation. 

1.3 Acceleration and gravity 

Physical theories embedded in special relativity correctly describe 
physical systems even at enormous accelerations, provided one is careful to 
view these systems from an inertial frame of reference, in which accelerometers 
read zero and three dimensional billiards can be played. A physicist in an 
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accelerated laboratory knows he is being accelerated, because his accelero¬ 
meters do not read zero and he cannot play three dimensional billiards. He 
cannot use the Lorentz transformations without supplementing them with rules 
for taking account of his continuously changing inertial frame. 

Suppose however this physicist is accelerating in a gravitational field, under 
free fall. His accelerometers read zero and he can play three dimensional 
billiards. Gravity has a singular property — or at least a property that 
seems singular to a physicist mostly brought up on the physics of electromag¬ 
netism and structures held together by electromagnetic forces. Everything is 
accelerated in a gravitational field at the same rate, and consequently in a 
small laboratory in free fall there is no internal way of detecting the accel¬ 
eration. This is now a matter of common experience, at least seen through 
the dark glass of the television screen. Conversely, a physicist in a labora¬ 
tory with an accelerometer (for example, a mass on a spring) reading 1 g has 
no internal way of telling whether his laboratory is at rest on the surface of 
the earth or being blasted (by silent and vibration free engines) beyond the 
orbit of Jupiter. Gravity thus has a singular link with acceleration and 
inertia — or perhaps we should put it the other way around and say that accel¬ 
eration and inertia are intimately linked with gravitation. 

These statements are based on Newton’s laws, appropriate to low velocity 
phenomena, and the Eotvo's-Dicke experiments which have failed to detect any 
differential gravitational acceleration of different objects, at an accuracy 
one part in 10^^-10^ (see Chapter 2). (The experiences of astronauts, 
though stimulating, do not constitute any very precise test of this statement.) 
Since we are concerned with gravitation and relativity, we should investigate 
whether this principle of equivalence will hold for high velocities too. The 
highest velocity available is c , so we examine the effects of both gravity 
and accelerations, of the kind produced by rockets, on light. 

We will consider first an accelerating rocket, in which the acceleration a is 
not too different from g , just to keep things simple. There are two problems 
we will work out to first order in the acceleration: the frequency shift of 
light due to acceleration and the departure of light from rectilinear propaga¬ 
tion. 

Light is emitted from a source in the nose of the rocket at time t=0, as 
measured in the frame comoving with the nose at this instant. It heads 
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towards the tail of the 
rocket with constant velocity 
c in this same frame. Be¬ 
cause the rocket is accelera¬ 
ting, in time t it picks up 
a velocity with respect to 
this original frame of at , 
and travels a distance 
^ at^ — these Newtonian ex¬ 
pressions are quite adequate 
for the low changes of velo¬ 
city involved. If the distance between the source and receiver is h , the 
time taken to traverse this distance at velocity c is t = — and hence the 

velocity of the receiver with respect to the frame in T^hich the emitter was 

ah 

instantaneously at rest is approximately — at the instant the light is 
received (see Fig. 1.3.1). The frequency of the light in the comoving recei¬ 
ver frame is thus Doppler shifted on reception at the tail of the rocket by an 
amount 

Ay _ ^ 

V ~ c2 • 

Corrections due to the path travelled in the emission frame being slightly 
less than h, Lorentz contraction and time dilation are all second order in 
small quantities. If the acceleration a is g and h is 10m , then 

=,10-16 _ (1.3.2) 

In a quantum picture the same result of course obtains because the transforma¬ 
tions for energy and frequency are the same: both energy and frequency trans¬ 
form as the fourth component of a four-vector. 

Next consider a beam of light emitted at right angles to the acceleration in 
the appropriate instantaneous comoving frame. In this frame it travels in a 
straight line with velocity c , covering a distance Ax = cAt in time At. 

In this time however the rocket has advanced a distance 

Ay = i-a(At)2 = , (1.3.3) 

If the light is to pass through a set of holes in the rocket structure at 
various Ax, the holes must not lie on a straight line but regress towards 
the tail of the rocket along the parabola (see Fig. 1.3.2) 

Lx\^ 


(1.3.1) 




XT 


t=0,v=0 

Fig. 1.3.1 
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Fig.1.3.2 The figure illustrates the 
parabolic trajectory of a photon in 
the accelerating frame of reference, 
corresponding to a linear trajectory 
in a specified inertial frame 

and light is deflected through an angle 


The angle the light has 
been deflected through (as 
seen hy the physicist in 
in the space vehicle) is 
then 

BAy a-Ax . . 

a =- = (1.3.4) 

We can apparently say that 
if the principle of equi¬ 
valence applies to light, 
of velocity c , then in a 
gravitational field g 
electromagnetic waves are 
shifted in frequency in 
falling a distance h hy 


Av _ £h 


(1.3,5) 


Aa=4Ax (1.3.6) 

in travelling a distance Ax at right angles to the gravitational acceleration. 


We may now see how this ties up with an alternative approach in which we direct¬ 
ly consider light in a gravitational field. We need the principle of equiva¬ 
lence in a slightly different form this time. In special relativity the iner¬ 
tial mass of a system (a particle, atom, light pulse or whatever you like) is 

equal to its total energy content divided hy c^. A light pulse of energy E 

E 

thus has inertial mass and if we are right in thinking that it is indeed 

c 

always the inertial mass that governs the gravitational interaction, then the 
gravitational energy of a light pulse in a gravitational potential cp will he 
F 

—^ cp, and it will experience an acceleration - Vcp. 


The momentum picked up sideways in travelling a distance Ax at right angles 
to the field is thus 

Ap=4l'P-¥ (1.3.7) 

c ^ 

E 

and since the momentum is — the angle of deflection is 
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Aa = vcp -^= g (1.3.8) 

T^(hich agrees Tvith the result we obtained by watching a rocket with proper 
acceleration g from an inertial frame. 


We can also get an answer for the gravitational frequency shift. An atom at 
rest has internal energy E and in a gravitational potential cp this becomes 

e(i + . We have conservation of energy and so for a transition between 

c ' 

two levels, separated by AE in the absence of a potential, the photon energy 
is AE • The difference in energy between a photon emitted at a poten¬ 
tial and a photon emitted at is 


AE^ - AE^ = 




(1.3.9) 


which agrees with the Doppler shift arguments for an accelerating rocket. It 
is important to note that the argument we have just constructed depends on 
conservation of energy and the weight of binding energy ^ being equal to 
g that is, binding energy behaves inertially and gravitationally like 

(negative) mass. We shall examine the best evidence for this (the Eotvos- 
Dicke experiments) in Chapter 2. 


1.4 Measurements of the gravitational frequency shift 

The gravitational deflection of light has not been observed in the laboratory, 
but the gravitational frequency shift has been verified to the l^o level using 
the Mb'ssbauer effect [9]. 14.4 KeV photons from ^^Fe (t = 10“’’' sec) were 

employed, going both up and down a path of 22.5m at Harvard. The fractional 
difference in frequency between photons going up (red shift) and photons coming 
down (blue shift) is thus 4.905 X lO”^^, detected by achieving resonance via 
the Doppler effect: the source must move at a velocity of 7 X 10"^ cm s"^ to 
re-establish resonance. The difficulty of the experiment is brought out by 
noting that the fractional line width, for recoilless emission, is '^10”^. 

The measurement was in fact made by investigating the difference in asymnetry 
of the resonance line as a function of velocity for red and blue shifted pho¬ 
tons* The result obtained was 0.9990 ± 0.0076 of the expected effect, the 
error being purely statistical, with an additional possible uncertainty of 
^ 0.01, being the linear sum of all contributing systematic errors. The con¬ 
clusion is that the gravitational frequency shift (1.3.9) is verified at the 
Ifo level [lO] . 
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The gravitational redshift suffered by a photon in climbing up a 22.5m tower 
is only 2.5 X 10"^^. Larger redshifts are available astronomically. A pho¬ 
ton escaping from the sun will have its frequency redshifted by an amount 

GM 

^2 = 2 X 10“®, while a photon escaping from a white dwarf of mass and 

radius 10"^km would be redshifted by 1.5 X 10”^. Observations of the 
solar gravitational redshift are in accord with expectation and have reached 
a precision [ll] . A comparison of measured and expected redshift for 

white dwarfs requires in addition to measurement knowledge of the mass and 
radius of the star: there is agreement but the precision is only ^^159^ [l2]. 
The terrestrial Mossbauer experiment is thus by far the most precise measure¬ 
ment of gravitational frequency shift, the astronomical measurements checking 
more crudely the first order prediction for stronger fields. 


1.5 The gravitational deflection of light 

While the gravitational deflection of light has not been measured in the labo¬ 
ratory, the deflection of light by the Sun was first measured in 1919 and 
constitutes one of the great tests of Einstein's theory of gravity, general 
relativity. We have already worked out the angle light is deflected in going 



Fig. 1.5.1 The transverse momentum acquired by a photon 
in passing the Sun is calculated by integrating the com¬ 
ponent of gravitational force at right angles to the 
approximately linear trajectory 


a small distance Ax through a locally constant gravitational field: let us 
use this result to calculate the deflection of light by the Sun. Because the 
deflection is very small we do not need to calculate the orbit but just calcu¬ 
late the change of momentum by integrating the normal component of the force 
along a straight line (Fig. 1.5.1). 


so 



E 

GM 

© 

r^.n 

dt 

II 

r2 

r 


J. f 

GMg, 

£*n 

J 

r2 

r 


dt = :^ and p = ^ 
c ® 


( 1 . 6 . 1 ) 

( 1 . 6 . 2 ) 
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a(b) 




GM^ 


:© 

J ^3 


dx . 


(1.5.3) 


The integral can be evaluated directly or by using Gauss’ theorem, for it 
represents multiplied by the normal flux out of an infinitely long 

cylinder of radius b . Thus 

2GM^ 


a(b) = ^ (1,5.4) 


and 


a(i^) 


bc^ 

2GM^ 




for light grazing the limb of the Sun, 

In this derivation we have used the relativistic relation between energy and 

E 

momentum for a photon (or light pulse), set the force equal to - —j 7(p, using 

c 

the relativistic relation between energy and inertial mass, and equated the 
force to the rate of change of momentum. Precisely the same answer is ob¬ 
tained by assuming that light is not observed to be deflected as it passes 
across a box freely falling in a gravitational field and constructing the path 
by fitting together such freely falling boxes, for such boxes are observed to 
be accelerating at a rate 


d^y 


GM 


© i. • ii 


dt 


2 - 


(1.5.5) 


as seen from outside the solar system and hence, as seen from outside the 
solar system the light beam has an equation of motion 


The angle of deflection is 


a^y L • 2. 

dx^ “ r 

(1.5.6) 

dy j. r • n 

dx ~ J 

(1.5,7) 


so a(b) = 2GM^/bc2 once more. 

Putting in numbers, = 1.99 X 10^ 
cgs units 


gm 


= 6.96 X 10^° cm , G = 6.67X10' 


.-8 


a(E^) = 4.245 X 10“® rad 
= 0.875" . 


Experimental observations are all consistent with a value 

a(RQ) =1.75" 

the value which almost everyone knows is predicted by Einstein’s theory of 
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gravity through the relation 


4GM 


R o'" 
© 


vhich is just tvice the Newtonian value we have calculated. 


(1.5.8) 


Experimentally a(R 0 ) has been measured in two ways : 

(1) By photographing the star field around the Sun during a total eclipse 
and measuring the displacement of star images relative to those on plates 
taken when the same stars appear in the night sky (at a remove of *^6 months) . 
The results have a spread on 0.(1^) of 1.3” -2,7” and may be taken as 
consistent with a(RQ)Qj^ to within an error '-25^4 [l3]. The problems are: 

(1) observations are limited to a(> 2R^) because of glare from the corona, 
(ii) total eclipses do not usually enshadow observatories with big telescopes: 
the diffraction image size for a 10 cm instrument is 5 X 10"® rad, (iii) it 
is necessary to compare separate plates taken and developed independently, at 
an interval of some months. 

(2) Long base line interferometry has been used to determine the change in 

the apparent position in the sky of the quasi-stellar radio source 3 C 279 
during its annual occultation by the Sun, The relative phase of the signals 
received by two radiotelescopes is monitored and continually compared with the 
relative phase of the signals received from the quasar 3C 273 which is not 
occulted and is 9.5° away from 3 C 279 . The wavelengths used lie in the 
range 3-15 cm and most baselines in the range 1-20 km . The relative phase 
change for a baseline d is ^ so a shift in phase of one radian corres¬ 
ponds to ^ " 2 ^ which for 10 cm radiation and a 20 km baseline is 10 “® 

radians. Most measurements have been restricted to values of the impact para¬ 
meter ^ 3 R^ because of refraction due to free electrons in the solar corona. 
Because the refractive index of the solar corona is frequency dependent and 
the expected deflection of radio waves by gravity is not, simultaneous work at 
two or more different frequencies allows smaller impact parameters to be used, 
while the use of baselines of thousands of kilometres should eventually permit 
the determination of Cx(R^) to very much better than \P/o. Early results were 
consistent with ©.(R^) = 1.75” with errors lO^^ [l3]. 

The most recent analysis of the occultation of 3 C 279 has yielded OL = (0.99 ± 
0.03) , obtained with an interferometer baseline of 845km [14]. 
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Dual frequency interferometry has very recently been applied to obtain the 
most accurate result to date [15]. Instead of the usual quasar 3 C 279 , 
three almost co-linear radio sources i<?ere used, 0119 + ll, 0116 + 08 and 
0111 + 02, The occulted source was 0116 + 08 and the outer two, lying res¬ 
pectively ^4^ and 6° away on opposite sides of the apparent path of the sun, 
were observed to eliminate local transient effects. The baseline was 35km 
and the deflection found to be 

a = (1.015 ± 0.011) . 

The observations of the deflection of electromagnetic radiation in the gravi¬ 
tational field of the Sun are thus beautifully in accord with the predictions 
of Einstein's theory, made 60 years ago, and totally inconsistent with the 
value calculated by augmenting Newtonian gravity with mass-energy equivalence. 
In most of our subsequent work we shall assume that the deflection of light is 
twice the Newtonian value. 


1.6 An apparent paradox 

We have arrived at a situation in which our simple calculations are not only 
at variance with observation but lead to apparently paradoxical conclusions. 

We have calculated both the gravitational redshift and the deflection of light 
by the Sun in two different ways : by using Newtonian gravity augmented by 
mass-energy equivalence and by using the equivalence of effect of a gravita¬ 
tional field and an acceleration. Both methods agree, for each phenomenon, 
but while we get an answer in accord with experiment for the first effect, 
redshift, we are wrong by a factor of two for the gravitational deflection of 
light. The observed value for the deflection of light suggests that while no 
gravitational redshift is observed in a box in free fall, light will be curved 
by an amount depending on the local gravitational field, violating the princi¬ 
ple of equivalence. Everyone has been told however that general relativity 
is founded on the principle of equivalence and yields both 


V 


gh 


and a(R^) = 


4GM, 


R_c" 


statements we have found to be apparently incompatible. 


We can see a possible way out of the paradox presented by our using Newtonian 
gravitation to compute successfully the energy change of a photon in a gravi¬ 
tational field and unsuccessfully to compute the deflection in the solar gravi¬ 
tational field. The component of force we integrated to give the deflection 
of the photon was at right angles to the motion. If gravity provides an 
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additional force al'ways at right angles to the motion which is equal to the 
Newtonian force for v = c , and negligible for v « c , then the result is 
explicable because a force at right angles to the motion does no work and 
hence cannot change the energy. The motion of a charged particle in a magne¬ 

tic field provides an example, and magnetism is a relativistic effect of 
electric fields. We shall therefore search for some sort of gravitational 
analogue of magnetism and leave the apparent violation of the principle of 
equivalence alone, for the time being. We shall embark on this search in 
Chapter 4, after examining the evidence for the equivalence of inertial and 
gravitational mass in Chapter 2 and studying the relation between electro¬ 
magnetism and relativity in Chapter 3. 
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CHAPTER 2 

THE EOTVOS-DICKE EXPERIMENTS 


2.1 Gravitational and inertial mass 

We have so far been assuming that the source of the gravitational inter¬ 
action is the inertial mass of a system: before we proceed further we must 
examine the evidence for this, for it is possible that the gravitational mass 


M and the inertial mass 
g 

Newtonian law of gravity as 


M. 

1 


are not the same. Thus we should write the 


G 




( 2 . 1 . 1 ) 


dnd equaEc sucli a force to M^a wliere a is "the resulting acceleration. Tlie 
equation of motion of a simple penduliim would then he written 

d^e 


M. 

dt 


, = - M f 9 

2 g -t 


( 2 . 1 . 2 ) 


and its angular frequency 


U) = 



If M = YM. 
g 1 

the definition of G . 


with Y a universal constant, then Y is set equal to 
If we define G by the relation 


F = G 


^il ^i2 


(2.1.3) 


then if Y varied with position in space, with time, or from material to 
material, this would appear operationally as a variation of G , assumed in 
Newtonian theory to be a universal constant. If Y is different for differ¬ 
ent materials, then the period of a pendulum will depend on the material of 
the bob, _, 


2.2 The Ebtvbs-Dicke experiments 

Experiments to test the universality of Y therefore search for a 
difference in the acceleration of tw different objects falling in the same 
gravitational field. The greatest precision has been achieved in the null 
experiments first carried out by E6*tvb*s at the beginning of the century and in 
the early 1960's by Dicke and his school. The Dicke group looked for a 
differential acceleration between two different objects falling in the gravita¬ 
tional field of the Sun. The principles are easily understood. Consider an 
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ideal earth with its rotational axis at 90° to the orbital plane. On the 
equator, two masses of different materials but equal weight, are suspended at 
opposite ends of the arm of a torsion balance, pointing North-South. At dawn 
each mass experiences a force 


GM, 


go 


g r^ 

to the East. To keep it in orbit a force M^vis required, directed to 
the East, where v is the orbital velocity of the earth. If the quantity Y 
is different for the two masses on the torsion arm, one will fall towards the 
Sun a little faster than the other, unless restrained by a twist in the sus¬ 
pension. The effective forces acting on each mass, as seen in the rest frame 
of the apparatus, are thus a gravitational force 


M 

P = M G -|2 
g g 

to the East and a centrifugal force ^ ^2 

F. = 

1 r 

acting to the West. At dusk, 12 hours later, the gravitational forces act 
towards the West and the centrifugal forces towards the East. If the ratio 
of and were not the same for both masses, there would result an 

oscillating torque about the suspension with a period of 24 hours (Fig.2.2.1) 



Dawn Dusk 

Fig. 2 . 2.1 An illustration of the principles of the Dicke experi¬ 
ment to test the equivalence of inertial and gravitational mass 

This oscillating torque would produce an angular oscillation about the suspen¬ 
sion with a period of 24 hours. In the most sensitive of Dicke's experiments 
the two masses were of gold and aluminium and such an oscillation was absent 
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at a level 10“® rad) implying equality of the constant of proportionality 
Y for gold and aluminium at a level of one part in 10^^: equality to \<?ithin 
3 parts in 10^^ at the 95% confidence level is quoted. Eotvos searched for 
an imbalance between gravitational forces due to the earth’s gravitational 
field and the centrifugal forces caused by the earth’s rotation. The appara¬ 
tus had to be rotated periodically (whereas Dicke let the rotation of the 
earth do it for him) and deflection of the torsion balance was looked for with 
optical lever, telescope and the naked eye, instead of optical lever, tele¬ 
scope and electronics as in Dicke’s experiments. Nonetheless, Ebtvbs estab¬ 
lished the equality of Y for a wide variety of substances to a few parts in 
10®. Reference [l] contains a detailed account of Dicke’s experiments and 
discussion of earlier work establishing the equality of inertial and gravita¬ 
tional mass, especially that of Eotvos. A measurement similar in principle 

to those of Dicke and his school has been carried out by Braginsky and Panov 

12 

[ 2 ] , who report equality of Y for aluminium and platinum within 0.9 in 10 
( 95 % confidence limit). 


2.3 Implications of the null result of Eotvbs-Dicke experi men;^ 

We MOW examine the consequences which flow from this null result. 
Remember that in calculating the gravitational redshift using gravity (rather 
than the equivalence of gravity and acceleration) we assumed that the energy 
of a system (such as an atom) in a gravitational field depended only on its 
total energy content (that is, its mass) and the potential and that conserva¬ 
tion of energy held locally in the transitions. The Braginsky version of the 
Eotvos—Dicke experiment yields the result 


We know the inertial mass of an atom to be composed of the following terms (at 
least) 

( 1 ) Electron rest mass 

( 2 ) Electron kinetic energy 

( 3 ) Electron potential energy (electrostatic energy) 

( 4 ) Proton and neutron rest masses 

( 5 ) Proton and neutron kinetic energy 

(G) Proton and neutron potential energy due to nuclear forces 
( 7 ) The electrostatic energy of the nucleus. 


Y = 


M. 


1 


Z 



Z Y.M*! 

j 

Z 

• 1 


2 Y-Mj 
j J ^ 

M. 

1 


Now 


( 2 . 3 . 1 ) 







30 


CaElAVITATION AND RELATIVITY 


Ti/here is the gravitational mass of a particular component and is the 

® 1 
inertial mass of a particular component. Then 

M. 

^ 6 y . (2.3.2) 

1 

Define r\ = Yp.^. - Y^^ so that the anomaly due to a single term, 6 y. is 
given by 

6Y. I = 

3 ' 


unless of course there are grossly implausible cancellations. 

The biggest difference between Pt and M is in the neutron/proton ratio. 
The compositions of Pt and Al atoms are 

Platinum: 78 protons, 78 electrons, 117 neutrons 

Aluminium; 13 protons, 13 electrons, 14 neutrons. 

The fractional weight of neutrons is thus 0.65 for Pt and 0.52 for At. . 

The anomaly in Y due to neutrons is thus less than one part in 10^^ and 
similarly for protons. If there is an anomaly associated with the neutron- 
proton mass difference, which is about 0.15%, then it must be less than one 
part in 10 . Similarly, if electrons have a ratio of gravitational to 
inertial mass different from unity, the difference must be less than one 
part in 10®. 

We may now turn our attention to the contributions of binding and kinetic 
energy rather than the rest mass energy of atoms. On putting together elec¬ 
trons, protons and neutrons to make an atomic nucleus and its retinue of 
electrons, these constituents pick up kinetic energy but lose a greater amount 
of potential energy in the formation of a stable system, which has less mass 
than the rest masses of the constituents. The mean binding energy is the 
modulus of the sum of these two terms, divided by the mass number (number of 
nucleons in the nucleus) of the atom. The mean binding energies of Pt and 
Al differ by about 0,3 MeV - approximately 3 X 10“^ of the nucleon rest mass 
energy. The anomaly in Y due to binding energy is thus ^ 3 X 10-® ; the 
gravitational mass-equivalent of binding energy is equal to the inertial mass- 
equivalent of binding energy to within 3 parts in 10®. We can now break this 
down into its component parts. 


6ri 


10-^2 

"ipt 

1 At 


iPt 1 At 

^iPt" 

M 

i At 


M “M 

i Pt 1 At 











THE EOTVOS-DICKE EXPERIMENTS 


31 


Nuclear electrostatic self-energy 

The electrostatic self-energy £ of a nucleus is quite well represented by the 
expression 


f _ 1 Zfef. 

5 R 


E = R 

o 


(2.3.4) 


A being the atomic mass number, Z the atomic number and R^ =“10 cm . The 
fraction of inertial mass contributed is thus (+) 8 X 10 ^ for Pt and (+) 

3 X 10-2 for At , so 6 static ^ ^ X 10“^°. 


Nuclear binding 

The binding energy of a nucleus is given quite accurately by the semi-empirical 
formula [s] 

B(Z,A) = aA - PA^- - e± 6 (2.3.5) 

where £ is the electrostatic term we have already evaluated, and 6 is an 
oscillating correction for nuclei of even A. The fraction of mass represen¬ 
ted by the first term is the same for Pt and At : it is a first approximation 
to the difference between kinetic and potential energy. The second term is a 
correction due to nucleons near the surface not interacting with the same num¬ 
ber of neighbours as those deep within the nucleus. We may take the second 
term to provide a measure of the quantity 6 y due to the strong interaction 
potential. The constant g is about 18 MeV, so the fraction of inertial mass 
contributed by the term in g is ^ 3 X 10 ^ for Pt and 6 X 10 ^ for A^ , 
and for the strong interactions 6 Y ^ 3 X 10”^°. The limit on 6 Y due to 
nucleon kinetic energy is probably about the same: we can make an estimate of 
it from the third term which essentially represents excess kinetic energy 
forced on nucleons by the Pauli principle, in a nucleus where the number of 
protons and neutrons is different. The fraction of inertial mass contributed 
by this term is 10“^ in Pt and in A^ : we thus find a limit on 6 Y 

from nucleon kinetic energy -'lO”^^. (The constant Y in the semi-empirical 
mass formula, not to be confused with the ratio of gravitational and inertial 
mass, is about 23 MeV.) 

Atomic structure 

The binding energy of the atomic electrons may be estimated from the Thomas- 
Fermi model of the atom [4] 

B^(Z) = 15.73 

and is ~ 6 KeV for At, 0.41 MeV for Pt. Electron binding energy thus con- 
tributes a fractional inertial mass 2 X 10”“^ in A-t, 2 X 10“^ in Pt , Then 
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^ ^ ^ 10"’’’ Mith the same sort of limit on the potential and kinetic 
energies of the electrons separately. 

Gravitational self-energy 

The Eotvos-Dieke-Braginsky experiments do not provide a measure of the ratio 
of the gravitational mass-equivalent and inertial mass-equivalent of gravita¬ 
tional self-energy. The gravitational self-energy of a uniform sphere of 
mass M, radius R is - f vhich for Pt is ergs. The fractional 

contribution to the inertial mass is thus 3 X 10“^® . 


Antimatter 

This is more exotic, but of interest because of the frequently encountered 
speculation that matter and antimatter might be mutally repulsive gravitation¬ 
ally. The inertial mass of antimatter (positrons, antiprotons ...) is cer¬ 
tainly positive so such a speculation implies negative gravitational mass. 

atoms contain temporarily virtual electron-positron pairs, which manifest 
themselves through corrections to the electrostatic potential [s] , and if the 
positron were repelled by the gravitational field of ordinary matter , the 
gravitational mass would be anomalous by an amount [6] 


,^Ze2\2 

~ "■e V hr; 

where the dimensionless quantity (Ze^/hc)^ is the order of magnitude of the 
probability of finding a pair in the field of the nucleus. We can see that we 
expect a dependence on Ze^ (rather than Z^e^) by a simple argument similar 
to that often used to calculate the range of nuclear forces due to the iT-meson 
(pion) “ but caveat emptor . 


If we separate a pair of electrons in the Coulomb field of the nucleus an 
energy Ze^ t\\—j is available. If this is equal to 2m^c^ we may expect vir¬ 
tual pairs to be important. The time scale At over which energy is uncerj- 

tain to AE is given by AE At h and Ar < cAfc ^, This is also the 

m^c 

Compton wavelength of the electron — if we cannot localise the electrons better 
than this, we may set r ^ Ar -- and expect pairs to be important if 


Ze^ 

he 


1. 


This sort of effect is known as vacuum polarisation — the effect of the virtual 
electrons is real in electromagnetism because their transient presence alters 
the effective charge distribution near the source of an electric field [ 4 ] and 
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contributes to the Lamb shift and the anomalous magnetic moment of the elec¬ 
tron, We may therefore have confidence that the gravitational and inertial 
masses of positrons are the same at a level 


6Y 

e"^ 


< 10 " 




m .137/ 
e 


2 X 10“® . 


Similar effects are expected to occur for the strong interactions: virtual 
proton-antiproton pairs (and pairs of other strongly interacting particles) 
should have a transient existence. We have no well developed theory from 
which the size of possible effects can be computed, but the depth of the aver¬ 
age strong interaction potential in the nucleus is about 10 

probability of finding nucleon-antinucleon pairs is perhaps '-^10“^. We should 

be safe in inferring from the Braginsky experiment that 6 y- 10 and 

^etc 

still have an enormous margin for error: we can be confident that antimatter 
does not fall upwards. The most precise test of the gravitational equivalence 
of matter and antimatter is found in K°-K° interference phenomena [7], 


Weak interactions 

We have considered potential energy from the strong, electromagnetic and gravi¬ 
tational interactions. In the first two cases the E6*tvos-Dick e-Bragin sky 
experiments allow us to conclude with great precision that the associated iner¬ 
tial and gravitational masses are identical. In the case of gravitational 
energy we can learn nothing from these experiments. The only other known 
class of interactions is the class of weak interactions responsible for nuclear 
beta decay. The fundamental interaction responsible for P decay may be 
represented diagraramatically as 



with an associated matrix element 


where the Fermi constant g 



1.4 X 10-^® erg cm^ [3], 


(2,3.6) 


There is evidence in nuclear physics for a similar interaction between two 


nucleons 
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and the energy associated with this is given hy the diagonal matrix element 

g r 'i'*y av ~ I ( 2 , 3 . 7 ) 

^ J p p n n V '' ^ 

where V is the nuclear volume. The weak interaction energy of just one pair 
of nucleons in A^ is thus ^10 ^^ergs ^ 6 X MeV corresponding to 3 X 
10 “^^ of the inertial mass. 


If the total energy goes as the number of nucleon pairs, the fraction of the 
inertial mass contributed by the weak interactions is in all nuclei. 

If we suppose a V>/o imbalance in this fraction between A/t and Pt then T] 
would be 10“^° if the weak interaction energy had no gravitational mass. 
If the imbalance was due to one pair of nucleons in A^ , the corresponding 
r\ would be ^ 3 X 10“^^. Thus the Braginsky version of the Ebtvbs-Dicke 
experiment suggests ^ 10“^ - 10“^, but there is not much margin for 

error. 


In summary, the ratio of gravitational to inertial mass of binding energy is 
unity to within 3 parts in 10®. Potential energy and kinetic energy consi¬ 
dered separately have Y = 1 to a rather better accuracy. Little can at 
present be said about the gravitational properties of the weak interactions, 
and nothing about the gravitational properties of gravitational energy. 

Conservation of energy and the mass-energy relation of special relativity are 
verified in nuclear physics at a level of about one part in 10^. This has 
been done by comparing the masses of nuclei as determined through mass spec¬ 
troscopy with corresponding energy differences measured through reaction kine¬ 
matics. With the equivalence of gravitational and inertial mass of energy 
established at the level of a few parts in 10® and the inertial mass equiva¬ 
lent verified at one part in 10"^ [s], it would indeed have been astonishing 

if the gravitational redshift experiment had failed to yield the expected 
answer at the l^/o level. 


We may now confidently accept the non-relativistic relation 

F = G 
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"wherG and arG inGrlial massGs and aUGn^I 10 find IIig rGlafivisfic 

la'ws of gravitation. Our position is analogous to that of a martian physicist 
who has accGss to naithar matals nor lodastona , hut has nonGtholess studiad 
GlGctrostatics and discovarad tha Lorantz transformations of spacial ralativity. 
Bafora attampting to find tha ralativistic laws of gravitation, wa will first 
attampt this martian task of finding the full laws of alactrodynamics from 
Coulomb’s law and special relativity. 
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CHAPTER 3 

MARTIAN ELECTRODYNAMICS 


3.1 Fields 

We imagine a martian physicist who in the dry atmosphere of Mars 

has acquired an excellent knowledge of electrostatics. Without having dis¬ 
covered electromagnetism, he nonetheless decides to attempt to measure the 
velocity of Mars through the hypothetical medium which supports light waves, 
obtains a null result with interferometers with arms of both equal and unequal 
length and is driven to discover the Lorentz transformations. He finds a 
principle of relativity philosophically attractive and so embarks on the task 
of finding a set of physical laws which are covariant under the Lorentz trans¬ 
formations and reduce in the low velocity limit to the laws of electrostatics. 
For the rest of this chapter we shall follow the hypothetical reasoning of 
this hypothetical martian. 

Abstracting from Coulomb's law we define the electric field of a point charge 
q to be given by 



(3.1.1) 


The surface integral of E is 





(3.1.2) 


if q is inside the closed surface (and zero if it is not). If many point 
charges are present we use the principle of superposition of electric fields 


to write 


1 E . n dS = 4tt / q. 

■j — — L—. 1 

S inside 


(3.1.3) 


inside 


and defining the local charge density as 


P= — 

V 0 



(3.1.4) 


we have 


E . n dS = 4tt J p diV 


S 


V 


where V is the volume bounded by the closed surface S . Now 
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J'E.ndS = J'_V.EclV = 4tt f p dV 
S V V 

and since the volume considered is arbitrary we have 

_7 . E = 4tt P . 

With the further definition 


E = - 7cp 

where cp is a scalar function of position, we reach Poisson's 


(3.1.6) 


equation 


v^cp = - 4r\p (3.1.6) 

and this is a convenient form of the laws of electrostatics from which to start 
the construction of electrodynamics. 


This equation is clearly not covariant under the Lorentz transformations, 
because the operator contains only derivatives with respect to x , y and 

z . We need to replace 7^ by an operator with well defined properties under 
the Lorentz transformation, in which x , y, z and ict all enter symmetrically 
and which in the static limit reduces to 7^ . We therefore replace the 


Laplacian operator 7 


by the D'Alembertian 


1_I ~ ^ ^ P + O — -o" = 7 

and this will give us an equation 

Q cp = - 4np . 


— ^ (1.1.21);(3.1.7) 




(3.1.8) 


In a region of space where p is zero, we have 


which has solutions 
where 


V^cp - ^ = 0 

^ c2 


cp = f (k . X - (jot) 
k2 _ Uji 

k - ^2 • 


(3.1.9) 


The replacement of by the appropriate operator in which x , y , z and ict 
enter symmetrically has at once yielded us an equation with a free field solu¬ 
tion which propagates as a wave at the speed of light. 


Our hypothetical martian would undoubtedly be tempted to identify his theoreti¬ 
cally discovered electric waves with light, but at once he discovers a diffi¬ 
culty. The electric field associated with the potential o is 

E = - 7cp = -k f' . 


For example, if 
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i (k . X - OJt) 

cp = cp^e 

then i(k . X - cut) 

E=-kcpe^-- 

-- — 

This electric field is in the direction of propagation of the wave and there 
is only one polarisation. If our martian is acquainted with optics he will 
know that light has two internal degrees of freedom, two independent polari¬ 
sations. 


We therefore examine more critically the content of the equation 

Q cp =- 4ttP . 

Since the D'Alemhertian is a four-scalar operator, the whole equation is only 
covariant under the Lorentz transformations if cp and p have the same trans¬ 
formation properties. If cp is a four—scalar, then p must he a four—scalar. 
The charge of a particle is 

q = J p dV = J p dx dy dz . 


If such a particle is moving past the observer who uses coordinates x' , t' , 
then he sees a charge 

q' = J p' dx' dy' dz . 

If p is a four-scalar, then p^=p. 



We have Ay^ = Ay, 
Ax' = Ax ^ 


the Lorentz contraction. 

7 ^' 



= A z but 


Thus 


Fig. 3.1.1 Lorentz contraction 
of a moving charge distribution 


if p is a four-scalar, charge is a 
function of velocity and tends to 
zero as v c. If our martian has 
already done experiments with high 
energy electrons (goodness knows how) 
he will reject this possibility: if he has not then he has an addition to his 
list of crucial experiments to be performed when technically feasible. 

If we reject the identification of cp and p with four-scalars, we may consi¬ 
der the possibility that 

□ cp =-4TTp 


relates the same components of two four-vectors. If p is the fourth component 
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of a four-vector it mil transform like an interval of time. Since 


At' = At (l 


- — 

.2/ 


2\-i 




T2\-i 


and the charge density increases so as to compensate for the Lorentz contrac¬ 
tion of a charge distribution and leave the charge invariant, 


= J p' dx' dy' dz' = j j 


1 - 




(3.1.10) 


This is an attractive possibility and becomes more attractive if we allow our 
martian knowledge of conservation of charge. Conservation of charge plus the 
general statement that the laws of physics are the same for observers in all 
inertial frames requires local conservation of charge: we may not have charge 
disappearing at one point in space and simultaneously appearing at another 
point, because different observers do not agree on the simultaneity or other¬ 
wise of events at different coordinates in space. Thus charge only disappears 
from a volume of space by flowing in a continuous way through the bounding sur¬ 
face. Local conservation of charge is contained in the equation of continuity 


7. J 


If 

3t 


= 0 


where J is current density, J = pv . 


(3.1.11) 


Now ict is the fourth component of a four-vector. If we identify (j , icp) 
as the components of a four-vector, the equation of continuity can be written 
in the manifestly covariant form 

^ = 0 (3.1.12) 

where 

= (i » ic p) , = (x , ict) 

and is shorthand for 

Ih ^ ^ 

dx^ SXg ''' dx^ 

in the convention of summing repeated (dummy) indices. The equation of con¬ 
tinuity thus takes the form of a statement that the four-divergence of the four- 
current is zero and we have a relation valid for all observers in inertial 
frames. With the charge density the fourth component of a four-vector, we have 
charge an invariant and locally conserved. 

For this vector theory of electricity and related phenomena we write 
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Ajj = (A, icp) 


(3.1.13) 


and the equation for Cp is recovered on setting = 4 . There are now four 

internal degrees of freedom in the field, hut only two polarisations for light. 
These four degrees of freedom are reduced to three on noting that since 
and 


we must also have 




= 0 


(3.1.14) 


thus providing one constraint. We still have one more degree of freedom than 
obtains for light however, but the gauge invariance of the equations decouples 
one of these degrees of freedom: we shall leave the discussion of this until 
Chapter 8. 


3.2 Forces 

The next task confronting our martian is an investigation of the forces 

acting on charged particles: so far he has only obtained the potentials. In 

his experimentally accessible electrostatics he has for the energy density of 

a charge distribution in an external potential (p the three-scalar quantity 

£ = pep (3.2.1) 

and a force density given by , ^ 

5=-pVcp. (3.2.2) 

He therefore expects the forces in the general case to involve the four-scalar 
function 4 ^ . 

corresponding to a current-current interaction between two charge distribu¬ 
tions 




(3.2.4) 


and the derivative operator Since the force on a pointlike particle 


in the static case is 


F = 


% 

dt 


= - cp 


in the general case we must have 


^ 1 T OA 

— = J VA 
dt c |i— ^JL 


(3.2.5) 

(3.2.6) 


plus perhaps other terms, where 
j = r p 


J = qv 


icq 
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This equation is certainly not written in a covariant way: we have three- 
vectors on both sides and differentiation with respect to t . We may start 
to write it in a covariant way by replacing the p and V three-vectors with 
four-vectors: 




Now 


. E^ 


■TT^ = — J,, A,, + other terms 

dt c |i ox |i 


(3.2.7) 


P^ = \^P , i“y‘ SO the generalised force equation yields the rate of change 
of energy as well as the rate of change of momentum : 


so that when v = 0, — = 0 , 

— dt 


dE ^ 

at = - • - at = I 

dE 


(1.1.30) 


(3.2.8) 


Since J = (qv, icq), only is non-zero when v = 0. So we have 

‘^ 4 , 

C Sx. 


— 

|v = o 


J4 3A4 

= "TT — + other terms = 0 


(3.2.9) 


This is achieved by setting 


1 T iV 

dt c lildx dx 
^ I V 1- 

which is identically zero for |i = v . 


(3.2.10) 


We still have the problem of a differentiation with respect to t alone on the 
left hand side, and we may be worried because = J is ^ that 

is the four-vector. We can however rewrite this equation in a manifestly co¬ 
variant way : 


so that 


^Pv 

1 

1 

Ic 

dT 


at 

dT 

dt 


T 



dT 

d 

= 1 TT ^ 

■ ^ dT 

dt 

dp 

dT 

-i 

c dT 1 

'iv 



(3.2.11) 


This equation is clearly covariant, for the charge of the particle q is an 
invariant, the four-momentum of the particle, p^, is a four-vector and so is its 
derivative with respect to the scalar quantity t, the proper time measured in 
the instantaneous rest frame of the particle. The derivative of the four- 
coordinate X of the particle, dx /dj , is also a four-vector, the four- 
velocity, and so the right-hand side is a four-vector fonned by taking two four- 
vectors and a four-vector operator. The quantity 
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BA. 


BA 


is frequently denoted by F. 

[ 1 ]. 


vp. 


, _V 

B X " B X,, 

V p 

and is called the electromagnetic field tensor 


¥e can obtain the force laws in a familiar form by working out the space-like 

and time-like components of equation (3.2.10), obtaining 

dp r 1 BA I ^ 

^=q i-7cp--^+-vX 7XA 
dt ^ I cBt c— — 

(3.2.12) 

1 


dE r _ 1 

— = qv. 1- Vcp- J 

The term in V X A provides a force always at right angles to the velocity 
and so does not affect the energy of a charged particle: this is the magnetic 
term. With the definitions 

B = 7 X A 


r 1 BA-^, 
E =-tVcp + - 


we have the familiar expressions 


q V . E 


dE 

dt 


(3.2.13) 


(3.2.14) 


We may calculate the acceleration of a particle by using the relation between 
momentum and energy 


E 

p = rav = V —j 
— c 


which gives 


so that 


Ov 

“,dt 


dt 


dv q r V . E 1 

^ = -i E-v ==-3= + 7 V 
dt m L— — c — 


V . E 


dE 

dt 

X B 


(3.2.15) 

(3.2.16) 


where 


E 


n 


' A V 

V1 - — 

C 

Thus by searching for a set of relativistically covariant laws that reduce in 
the low velocity limit to the familiar laws of electrostatics, our hypothetical 
martian has uncovered the laws of electrodynamics. We will parallel this 
reasoning in Chapters 4 and 5 in an attempt to find the laws of gravitation of 
which Newtonian gravitation represents only the low velocity limit. 


) 






44 


GRAVITATION AND RELATIVITY 


3.3 The Lagrangian formalism; for experts 

The four-scalar interaction — ^ A has the dimensions of energy den- 
sity hut does not transform as an energy density. It can however he inter¬ 
preted as an invariant Lagrangian density corresponding to the interaction of 
the current density with the external field A^. Integration over the 

space variables provides, for a point-like particle, the interaction Lagrangian 


^ %A 

c dT ji 


(3.3.1) 

The equations (3.2.11) may he obtained at once from the invariant Lagrangian 

1 


^ = 2“o'df '57+1 if \ 

inserted in the covariant version of the Euler-Lagrange equations [2] 


(3.3.2) 


dT 


-ik 


dL 

3x 


= 0 


(3.3.3) 


where 


dx 

dT 
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CHAPTER 4 

RELATIVISTIC GRAVITATIONAL FIELDS 


4.1 The gravitational Poisson equation 

We first obtain the gravitational version of Poisson’s equation. 
Abstracting from Nekton's la'W of gravity, "we define the gravitational force 
field due to a point mass M as 

H(r) = - GM ^ . (4.1.1) 

The surface integral of this force field H is 

Jh . n dS = - 4 ttGM . (4.1.2) 


To obtain the equivalent of Poisson’s equation, we take a volume AV contain¬ 
ing a total mass M composed of a number of smaller masses each generating 
a field 



r H. . ndS =- 4TrGM. . 
J —1 — 1 


(4.1.3) 

(4.1.4) 


If H = EH. then 


J H . n dS = - 4 ttG SM^ 


(4.1.5) 


but it is important to note the assumption that gravitational fields obey the 
principle of superposition, that is, they add vectorially. This is certainly 
justified in a weak field approximation, but we must beware when considering 
strong fields. 


We define the mass density 


P 


as 


P = 


lim 

AV-0 


ZM. 
_ 1 

AV ’ 


when using Gauss's theorem we have 

J H. n dS = J V.HdV = -4nG J p dV (4.1.6) 

or 

V. H = - 4ttGp . (4.1.7) 

We now define a gravitational scalar potential h so that H =-Vh whence 

V^h = 4TrGp (4.1.8) 

which is the gravitational equivalent of Poisson’s equation. 
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In order to construct a covariant version of this equation we first make the 
replacement 


n 

Q h = 4TrGp , 


(4.1.9) 


In empty space we have 


(v2 _ h = 0 

c2 dt^/' 


(4.1.10) 


which has solutions which are waves travelling with velocity c ; we are at 
once led to anticipat e the existence of gravitational waves propagating with 
i^elocity c . 


4.2 The properties of the source of gravitational fields 

Since Q is a scalar operator, the gravitational potential h must 
have the same transformation properties as the mass density p: to find out 
more about the gravitational potentials we must study the properties of p. 

We happily have more experimental information than our hypothetical martian 
inventing electrodynamics ; we know that particles of velocity c (light) are 
deflected in a gravitational field. 

Suppose that the gravitational potential h is a four-scalar. Then p would 
be a four-scalar. However, if this were the case we would have 

r p I "P I - p 

M'= I p' dx' dy' dz' = ! p / 1-^ dx dy dz = / 1-^ M (4.2.1) 

and the gravitational mass would tend to zero as v c and a particle of 
velocity c would not be deflected by a gravitational field. 


Could h be the fourth component of a four-vector h , as the scalar electro- 

M- 

magnetic potential is the fourth component of A^? Such an identification 
requires p to be the fourth component of a four-vector and hence M to be 
an invariant. The inertial mass of a particle is not an invariant. The 
proper mass is an invariant, but for a photon the proper mass is zero and so 
again light would not be deflected by a gravitational field. Indeed a parti¬ 
cle of arbitrary proper mass m^ would not be deflected as its velocity ->c, 

since the gravitational force would be F oc m while the momentum 

o 

moV 

p = mv = 

4-^ 

c^ 


r F dt 
" J p 


so that 


P 
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as V c : an electron is deflected through ever smaller angles in the 
Coulomb field of a nucleus as v -> c , 


¥e must consider the properties of the mass density p more carefully, and 
turn once more for help to the conservation laws. We have the equivalence 
of inertial mass and energy and we have conservation of energy, Eelativistic 
covariance implies local conservation of energy \diich we express through an 
equation of continuity : 

7. +11=0 (4.2.2) 


where = v£ and £ is energy density. We must beware of making the iden¬ 
tification of the quantities , ic^) with a four-vector, because we know 

that energy and momentum make up a four-vector. Each conqponent of momentum 
is also conserved, and if P denotes the momentum density, then for the x 
component of momentum we have 


7.(vlP) + ^ P = 0 

— x' at X 

and for the i^^ component of momentum 

BP. 

I- (vPi) +-§r = 0 


which we may write as 


^ biP. 

^r.P.+ = 0 

dx. j 1 at 
J 


(4.2.3) 


(4.2.4) 


We can combine these equations with the equation for energy conservation and 
write (in the absence of external forces) 


where 

Since 




Bx 


= 0 


is the energy-momentum density tensor. 


P = V “2 (= V p) we can write out ^ explicitly. 


r 


3 : = e 

l-LV 


V V 
X X 


V V 
X y 


V V 
X z 


c 

i V 


V 


V V 

■ 7. I 

^2 


V V 

c" 

i v^ 
c 


V V 
Z X 


V V 
z 3^ 


V V 

z z 


1 V 

X 

c 

i V 


c 

i V 


3 


- 1 


J 


(4.2.5) 


(4.2.6) 
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or setting (v , ic) = v 


liv 


V V 

= e 


(4.2.7) 


This is only a shorthand however, v^ as defined here is not a four-vector, 

because v, = dx, ,/dt: the corresponding four-vector is u = dx /dr. 

la ^ ^ ® li [i' ' 


A tensor t 




has the transformation 
. f 

Vv 


lip va pa 


(the Kronecker delta function 6 is a tensor) so that 3^!^ = a^ a, 3^ 

liV ’ 44 4p 4a pa 

if 3^^^ transforms as a tensor. If the unprimed frame is the rest frame. 


where only 3^ is non-zero, then 


Then 


^44 ^44 ^44 ^44 


"44 


!' = - J3^dx'dy'dz'=- J 


44 


(4.2,8) 




dx dy dz 


1 


,,C2 


dx dy dz 




(4,2.9) 


c c" 

The energy of a particle indeed transforms like the fourth component of a 

four-vector, and the quantity 3 transforms as a tensor. 

jiV 

4.3 Possible forms of relativistic gravitational fields 

We might therefore guess that correct source for the gravi¬ 

tational field and write 


□ h 


^v 


k 3 


liv 


(4.3.1) 


with h having sixteen components. Since 3 is synmetric, h will 

H” V |jlV 

be synmetric, reducing the number of independent components to ten. The con¬ 
servation laws further reduce the number of independent components : since 


d 3. 




we have 


Bx. 




"dx 


= 0 


= 0 


(4.3.2) 

(4.3.3) 


leaving six independent components. 
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Compare 


with 


□ h 


lav 


k3:. 


|iV 


We must identify 
write 


□ h 

with 


4ttG 


4TrG p = 4TrG —^ ^ 


44 


"44 


and hence the constant k with 4TrG/c^ and 

(4.3.4) 


l—' 1J.V 14V 


SOTirCG 

The four-scalar interaction^ between the field generated by ^ ® 

test matter distribution, which reduces to -hp in the low velocity limit is 
(compare the electromagnetic interaction 


^p.v 


which for a small test particle of mass m becomes, on integrating over 
volume , 


4-l3il.V 


= - h 


V V 

V 


(4.3.5) 


c- 

Suppose we choose to be at rest in the frame of the source of the gravitational 
field. Then 

□ 4ttG source 

‘*44 = --^ ^44 

and because we are at rest in the frame of the source, all other con 5 )onents of 

2 *source zero if the source has no internal motions. These components 

44 

being zero, they can generate no field and a single source at rest with res¬ 
pect to the observer only generates an h^ component of the gravitational 


potential. For a time independent field 


^AA = - 

44 


4ttG 


2 ^44 


source 


and the solution of this equation gives just the Newtonian potential, 
interaction with a test matter distribution is for this case 


The 


,2 


^AA ^AA 

44 44 


c c^ 

which is just the Newtonian interaction. Equations (4.3.4) and (4.3.5) thus 
lead to the Newtonian deflection of light by the sun - we have a problem here, 
for in this particular case our elaborate relativistic theory gives results in 
agreement with the naive reasoning in Chapter 1 , and in disagreement with 
experiment. 
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Nonetheless, before attempting to resolve this discrepancy it is i^orth noting 
that effects other than Newtonian come in when elements other than the 44 
elements of both h^^ and are non-zero. Just as in electromagnetism, 

we have differences from the Newtonian case if both source and observed parti¬ 
cle are moving with respect to the observer. Even if the centre of mass of 

the source is at rest, a spinning source will generate extra terms in h 
giving rise to both a spin-orbit interaction and, if the test particle is also 
spinning, a spin-spin interaction. This is why the measurement of the preces¬ 
sion of an orbiting gyroscope is of interest [l]. 


We must now examine whether there are any other forms for the source of the 
gravitational field that reduce to £/c^ in the low velocity limit. We 
raised earlier the possibility of a scalar field which would have to couple 

can we construct such a field using J as the source of 


to a four-scalar 
gravitation ? 


\1V 


We have already seen that 

so that 


^44 = «44 844 ^44 


^(rest frame) 


1 

^ c^ 


The quantity /c^) is thus an invariant, the proper energy density, 

and —(1-^ /c^) is the proper mass density. Then 


E = |e dxaydz = J - ^ -1 ^ - ® - ®^^ . ®°-®) dxdydz 
P £ (rest frame) 


V 
c2 


A - 4 

V c^ 

E (rest frame) 


(dx dy dz) 


rest frame 


(which just repeats the work leading to equation (4.2.9)). 


The four-scalar £(!-'''’ /c^) can be formed from J 


^JIV: 


&(1 - ^ 7 c 2 ) = - J 




(4.3.6) 


where the quantity J is the sum of the diagonal elements of J 

14 i ^ jiv ’ 

the trace of J 

UV 


called 
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We can thus envisage t\tfo forms of scalar interaction het^een a source density 
3 : source ^ second tensor 'S , \vhich reduce to the non-relativistic 

pv pv 

Neivtonian limit. The first gives no results other than Ne\ytonian when the 
observer is at rest in one or other of the frames of the two masses and is of 

j source 

pv pv 

a four-scalar product of two tensors, while the second is of form 

,source 


S' S" 
pp vv 


and is the product of two scalars. 


This second term may he written as the product of two tensors, for the 
Kronecker delta function is a tensor, 

S 6 

pv pv (JO 


and we identify a second kind of possible gravitational field which has a 

-k N 

scalar source . We write the ’Newtonian’ field h ... and the scalar field 

S 


pv 


h*^ and have 

pv 


.N 


pv N pv 


and I I h^ = k^ 

'—' pv S 

which is the equation for a scalar field, disguised as a tensor. 


s pv oa 


(4.3.7) 

(4.3.8) 


If we are in the rest frame of a source with no internal motions, we have only 
^44 non-zero, while 1 ^22’ ^33 ^44 non-zero. The scalar 

field yields an interaction which contains a piece dependent on the value of 
vVc^ for a particle measured with respect to the rest frame of the source, 
and of course 

(4.3.9) 


3: h® 
pv pv 


3 3® ■ 

14i otr 


1 (? 

1 - £ 3 


44 


in the source frame, so that light is not deflected in the scalar theory. Note 
also that the scalar field has no off-diagonal terms. 


We may make a more general theory in which we form the gravitational potential 

N S 

h out of any suitable combination of h and h . In the low velocity 

pv . 

limit we must recover Newtonian gravity, while in the high velocity limit we 
want twice the Newtonian interaction. Both these features are achieved by 
setting 


*This is a directly coupling scalar field. Very different properties 
may be obtained for an indirectly coupling scalar field [2,3] . 
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□ =-^ 


i 3: -4-6 

I liv 


l-LV (JO 


(4.3.10) 


where 


(Note that S' 


^—, 8 n& - 

I h = — Q Si^ 

^ MV MV 


3 : = 3 : - 


aa 


MV MV 
is a scalar and 6 


MV 


" MV 00 
is a tensor.) 


These equations express TEIE tensor theory of gravity, which is distinguished 
from other theories with tensor fields by a very special characteristic. The 
field h^^ defined by Eq. (4.3.10) is the classical version of the field 
which in quantum theory is mediated by the exchange of mass-less, spin 2 parti¬ 
cles (gravitons) in much the same way as the vector electromagnetic field is 
mediated by the exchange of mass-less spin 1 particles, photons. A gravita¬ 
tional wave generated by 3^^^ has only two polarisations : this is discussed 
further in Chapter 7. From now on, when we refer to the tensor theory, we 
shall mean the theory embodied in Eq. (4.3.10). The scalar theory is embodied 
in Eq. (4.3.8) although disguised as a tensor field, and the 'Newtonian' 
theory of Eq. (4.3.7) is part tensor and part scalar : we may loosely call it 
a half tensor theory. Eq. (4.3.10) can be written 


□ h 


MV 


8ttG j ' 
^2 I' 


+ 4 6 


Pv 


v^ « 


(4.3.11) 


A static field in this tensor theory is thus given by 

V. v 


pv 


%4 6 , 


MV ^ 




(4.3.12) 


and if v. 


Vg , V3 = 0 


^ h.. = 

44 


V4 = 1C 


then 


h 


11 


8nG Ji 


(4.3.13) 


and similarly for ^221 ^ 33 * 


Solving for the fields of a spherically symme¬ 
tric source of mass M we therefore have 


, GM 

hy,. = “ - 

44 r 


^11 


- ^22 - '*33 “ r 


The static fields of a mass M can be written for the three cases as 
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r 


UV 


V 

f 


GM 

r 

0 

0 

0 


r 


P-V 




V 


0 


0 




-iiii 0 


0 

0 

0 

0 


r 


Scalar (4.3.14) 


0 - 


ilili 0 


GM 

r 


0 


0 

m 

V 
0 


Newtonian 
(or 'half (4.3.15) 
tensor') 


r 


^ 0 


0 - — 


Tensor (4.3.16) 


y 


in the rest frame of a spherically symmetric source. In the scalar an(3 tensor 
cases a fast particle has a gravitational interaction depending on its momen¬ 
tum as well as its energy. In the scalar case this decouples a fast particle, 

in the tensor case it yields twice the interaction: if we write the interac¬ 


tion with a particle as 
(4.3.14) - (4.3.16) : 


-T h /c^ Ttfe have using the fields of Eqs. 


liv pv 

T h® 

pv uv 

^2 
,N 


/ GMm 


T h 

uv uv 

" c" " 


GMm 

r 


^Uv'^UV [a GMm 


(4.3.17) 

(4.3.18) 

(4.3.19) 


The forces will certainly involve the gradient of these quantities : we expect 
that the tensor theory will give twice the deflection of light obtaining in 
our original Newtonian construction. 
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Thus with the tensor theory we have Lorentz covariant equations connecting the 
source and the fields, the Newtonian interaction in the low velocity limit and 
twice the Newtonian interaction for fast particles. ¥e still need to work out 
in detail the force laws however, and check that this theory yields the proper 
gravitational redshift for photons despite the extra term in the interaction 
which seems likely to yield twice the light deflection expected naively. A 
further problem is whether despite the additional deflection of light we can 
still maintain that light travels in straight lines in a freely falling box. 
The next two chapters are devoted to these questions. ¥e must end this chap¬ 
ter by pointing out a probable deficiency in our treatment of gravitation. 


4.4 Non-linearity of the field equations 

¥e have set up a theory in which the source of the gravitational field 
is the energy-momentum density. However, the energy in the gravitational 
field is itself expected to be a part of this source: strictly speaking we 
should write 


□h, 


liv 


1 , (gravitational) + (everything else) } (4.4.1) 

|mIV LL\^ j 


pv 


and we expect (gravitation) to be composed of bilinear products (’squares') 

of the derivatives of h itself. Thus for complete consistency we should 

H'V 

have a non-linear differential equation for h^^ and the principle of super¬ 
position would not hold. This should not be important for weak fields 
however, and we are mostly concerned with weak fields. Indeed, the linearised 
(weak field) equations of general relativity can be written in the form 
(4.3.10). 


This kind of problem does not arise in electromagnetism because charge is the 
source of the electromagnetic field and the electromagnetic field being 
neutral (photons are uncharged) it does not contribute to its own source. 


There are two obvious concrete examples of the difficulties with the linear¬ 
ised theory. First, we expect gravitational waves to be deflected by a 
gravitational field in the same way as light is deflected. Gravity must 
therefore be a source of gravity. Secondly, the gravitational energy of a 
mass M, radius R is - GM^/R so that the total energy is 


which goes negative for a 


Mc^ 


GM ^ 

2y 


Rc 


finite value of M/ R . 


However, the increasingly 
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negative gravitational energy reduces the gravitational mass and if even for 
gravitational energy the equivalence het'ween inertial mass and gravitational 
mass is maintained the total energy can remain positive. 
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CHAPTER 5 


RELATIVISTIC GRAVITATIONAL FORCES 


5.1 The velocity of light 

Using the energy-momentum density as the source of relativistic gravi¬ 
tational fields, we have picked out the tensor field with source 
this case the interaction of a spherically symmetric field due to a source M 
with a test particle of mass m is given hy 




i 1 + 


C-^ -|iv ^ r 

which reduces in the non-relativistic limit to the familiar Newtonian result. 
The gradient gives the Newtonian force for v/c « 1 and twice the Newtonian 
force for v/c 1: we may reasonably guess that this form will indeed yield 
twice the Newtonian deflection for a fast particle - or a light pulse. Before 
attempting to construct the force laws, it is extremely instructive to examine 
a necessary consequence of the gravitational deflection of light; the local 
velocity of light must depend on the local gravitational potential. 


GMm 


(4.3. ^ 9 ) ( 5 . 1 . 1 ) 


We have so far considered the forces acting on relativistic particles, or 


localised pulses of light. If we consider instead an extended wavefront. 



Fig.5.1.1 The normal to a plane of 
constant phase is rotated through 
an angle A a in distance Ax as a 
consequence of the phase velocity 
varying with y 


deflection of light corresponds to a 
plane of constant phase, (which is 
normal to the direction of propagation) 
being turned through an angle in pass¬ 
ing through the gravitational field. 
This is achieved by a reduction in the 
phase velocity of the wave as the 
gravitational field gets stronger 
(Fig.5. 1 . 1 ); light passing close to 
the limb of the Sun will travel more 
slowly on average than light far from 
the Sun . 


The gravitational field thus plays the role of a refractive medium and we can 


calculate an effective refractive index. We set 


Aa = 


v(y + Ay) - v(y) 
Ay 


At 


A. Ax n . 
At = — = — Ax 
V c 
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and then 


A ^ 1 dn * 

Aa = - ~ — Ax 
n dy 


( 6 . 1 . 2 ) 


This may be identified with the analogous equation obtained from thinking in 
terms of a relativistic particle ; 


Aa = - 




dx 


\ 




GM £• X . „ GM y . ^ 

^3^.2 y - 2 ^2^2 r (5.1.3) 


where we are assuming the normal component of the force to be given through 
the gradient of the tensor interaction. (Fig. 5.1.2 and Chapter 1, section 5.) 



Fig. 5.1.2 

¥e may then make the identification 


i ^ _ 2 I 

n dy ^2^2 r 


(6.1.4) 


Since the deflection of a relativistic particle given by the gradient of 
Eq. (5.1.1) is independent of its energy we expect a photon to be deflected 
by an amount independent of its frequency. If a pulse of light is deflected 
by an amount independent o*f its frequency spectrum, then the gravitational 
field must be non-dispersive: the effective refractive index must not depend 
on frequency. 


Integrating (5.1.4) in the weak field approximation we find that 


n(y) - n(«>) = J 2 ^ 


-^3 dy = 2 
(x2 + y2)3 rc2 


(5.1.5) 


As r -> 00 want the velocity of light to be c , n 1 and so with 


n = 1 + 2-^ (5.1.6) 

rc 

we achieve consistency between a plane wave picture of light and a pulse (or 
photon) picture. We can see at once that our full force laws cannot possibly 
be given by the gradient of (5.1.1) because here we have light slowing down as 
it enters a gravitational field, despite the tendency of the gradient term to 
accelerate a particle. 


We may at this stage begin to suspect that constructing a relativistic theory 
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of gravity will not be as easy as it was beginning to seem, for the constancy 
of the velocity of light is fundamental in special relativity. 

5.2 Radar ranging in the solar system 

It should now be clear that a necessary consequence of the gravita¬ 
tional deflection of electromagnetic waves is a reduction in speed of light in 
the deflecting field. This is observable and constitutes a further experi¬ 
mental test of relativistic gravitation. 


If radar is bounced off a planet when it is near superior conjunction, so that 
the line of sight passes close to the limb of the Sun, there will be an excess 
delay introduced in the time for the round trip. The change in path length 
due to deflection in the Sun’s gravitational field is second order in the 
small quantity , and so we can compute the time to first order by 

considering a straight line trajectory (Fig. 5.2.1); 



The excess delay is the second term. We evaluate it in pieces. The excess 
delay introduced in going between Earth and the point of closest approach is 


2GM 




r / ? p 

tn^^x + A,/ x'^ + b j 




Vr^ - b^ 


and in going between the point of closest approach to the Sun and the planet 




r^tn -'J^x + x^ + b^ j J 




The one-way journey between Earth and the target planet thus introduces an 


excess delay 


2GM^ 


--tn 




and the total excess delay in the round trip is 
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4GM 

= “73" 


•r + -,pr + - b^-, 

" _ ir-E—E—j 

JL b J 


r J. — u "‘if—*- 

-to I — iF-E—-a 


with b « r , r we have 
P e 


and 


4GM r- r r 
t„ = -^ to [4 I 


E - c3 


,2 -1 


8®*© 




be' 


(5.2.2) 

( 5 . 2 . 3 ) 

f 

( 5 . 2 . 4 ) 


If we take Mercury as the target planet with 


and 

we find 


r = 1.495 X 10^^ cm- 
e * 

r = 0.565 X 10^^ cm , 
P ’ 

b = E© = 7 X 10^° cm , 

Mq = 2 X 10^^ gm , 

tg = 220 |jsec 


in a total time of 


(r + r ) 

2 — -^ « 22 mins- 


(note that 100 jjsec is the time taken for light to travel only 30km) , and 



5.7X10“"^® s cm-^ . 


The planets Mercury, Venus and Mars and certain suitable space probes have 
been used as targets: currently the predictions of general relativity are 
verified at level [l]. We should note that a measurement of the coeffi¬ 

cient in ( 5 . 2 , 4 ) removes the need to know planetary orbits with great accuracy 
(the best way to determine them is by radar ranging) and also sweeps luider the 
rug subtleties relating the real earth based system of coordinates, immersed 
in the Sun’s gravitational field, to the idealised coordinates we have used, 
(See section 9.9.) 


As we pointed out in Chapter 1, the refraction of radio waves in the solar 
corona complicates measurement of both the gravitational deflection of radio 
waves and of the excess delay introduced by the solar gravitational field in 
radar ranging. We may now put in a few numbers. 


The equation of motion of a free electron in an electric field, neglecting 
any damping, is : 
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^ 

dt^ " " 


(5.2.5) 


and so for a fixed frequency U) 


eE 

2 

U) m 


and a single oscillating electron corresponds to an oscillating dipole moment 


e^E 

- ex = - — . 

m(jr 

If there are N electrons per unit volume, the dipole moment per unit volume 
is 


Ne^ E 


mi 


and the susceptibility of the gas of free electrons is 


X = 


The dielectric constant is thus 


e = 1 


Ne^ 

4nN 


and the refractive index 


muo^ 

r 

n =:/e« 1 - - 


mu)^ 


is to be compared with the refractive index due to gravity 

2GM^ 


n = 1 + Y 
h rc 


(5.2.6) 

(5.2.7) 


In a gravitational field the phase velocity is reduced and the medium is non— 
dispersive. In a cloud of electrons the phase velocity is increased but the 
medium is dispersive. A beam of radio waves is deflected away from the Sun by 
electrons in the corona, but a pulse travels with the group velocity in such 
a dispersive medium. 


V 

g 


c 



2ttN e^ \ 
muj^ 


We therefore want to compare the quantities 2G>^rc^ and 2TTNe^/mu)^ . If 
we take N = 5 X 10® (r/R^)”^ cm“^ , [2] , then the ratio 


2ttN e^ 


2GMq 


= 3.7 X 10® 




rc 


2 


which is approximately 0.5 for X = 10 cm , r = E . It should at once be clear 
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T^hy impact parameters smaller than a few solar radii have not been used in 
these studies: the deflection measurements of reference [3], at X = 3.7cm , 
employed no impact parameter less than 10 R^ , 


5.3 Further remarks on the variable velocity of light 

With our tensor theory we have deduced an effective refractive index 


n = 1 + 


2GM 


= 1 


2 cp 


rc c 

for a spherically symmetric field^ and found that light slows down as it 
enters a gravitational field, despite the effect of gradient terms increasing 
the momentum of a particle. We must search for extra force terms anyway in 
order to have covariant force laws, hut will do well to remember that a velo¬ 
city V can decrease while a momentum mv increases provided the mass m 
changes fast enough. (in special relativity with a constant force dv/dt->0 
as v->c because of the relativistic increase of inertial mass.) We have 
here a hint that mass may depend on the local gravitational potential, having 
already found that the velocity of light depends on the local potential. 


We may note another interesting result. The dimensionless quantity 2GM/rc^ 
measures the change in the refractive index of empty space in the neighbour¬ 
hood of a massive body such as the Sun. But all our experiments are done 
when we are already immersed in the gravitational field of all other bodies in 
the universe. The potential at the centre of a sphere of constant density 


R 


G 


4TTj0r^ 


dr = - 2 ttG pB^ ~ ^ 


and so the quantity 


2 cp 


universe 


4TrGp . 

universe universe 

-- 


If we take R 


universe 


sity of matter, 
then 


'universe 


as the Hubble radius, 
« 10 ‘ 


10^® cm and for the mean den- 


‘ ^^gm cm ^ (still a very uncertain number) 


2 cp 


universe 


10 “ 


which is a number perhaps surprisingly close to unity, considering the large 
(cosmological) numbers fed into the calculation. Here we have a hint that 
perhaps our local physics is determined by the properties of the distant 
parts of the universe [4] , through the long range gravitational interaction 
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'Which, unlike electromagnetic effects, cannot be cancelled out because there 
are no negative gravitational charges. (See section 10.5.) 


5.4 The force laws and equations of motion 

In order to determine the equation of motion of a test particle in a 
gravitational field, we must evaluate the force on a particle, not a force 
density, and one term must be the generalisation 


Then 


— T 7h^.->-m7cp 
^2 KiV- IJV 

dp . 

— = -4r T 7 h . 
dt 


in the Newtonian limit . 


other terms (5.4.1) 


and more generally the time derivative of the momentum-energy four-vector p^ 
will be given by 


+ other terms. (5.4.2) 


dt ^ MV anCp MV 

The other terms must presumably he of form 

-J- T — h 

^2 IJV pV ’ 

noting that the only other possible force field to couple to T^^ is 
which gives nothing new because T^^ and symmetric. Then we must 




VP 


have 


“ -2 ■‘'ijv Bx p,v 3x 

P V 


(6.4.3) 

dt c" “ ■■ 

and we have to determine the quantity a in order to establish the eq\iations 
of motion. 


Consider the spherically symmetric fields generated by a source M, the fields 
not being explicitly time dependent. Then 

GM 

^11 ^ ^22 ^ ^33 = “ ^44 = “ ^ - r 
and we can write, setting c=l for convenience 


dt 


I V ^ + cXmv V -T- 

MV 3Xp MV Bx^ 


(5.4.4) 


where 


V. = V. 
1 1 


dx. 
_1 

dt 


V4 = 1 


and the quantities v do not make up a four-vector. Then since dcp/dt= 0 

V 
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- m(l + v^) Vcp - am vv . Vcp (5.4.5a) 

^4 6E 

= i amv . Vcp , ~ = CXm V . Vcp . (5.4.5b) 

It is possible to write these equations in terms of a single scalar potential 
cp only because we are considering the particularly simple case of a spheric¬ 
ally symmetric source at rest in the reference frame. It must always be 
remembered that the gravitational potential is a (16 component) second rank 
tensor. 


The rate of change of kinetic energy is the scalar product of force and velo- 

^ _ — 
dt • dt • 

This is familiar in classical mechanics: it also holds in relativistic mecha¬ 
nics since E^ - p^ = m^ (in units with c=l) whence 


® dt “ £ • dt 


In order to satisfy this relation we must set a =-l when (5.4.5a) and (b) 
become 

— = - m(l+v^) Vcp + myy._Vcp (5.4.6a) 


— = - mv .Vcp . (5.4.6b) 


We may note that (5.4.6b) is unchanged in the Newtonian limit, while (5.4.6a) 
reduces to the Newtonian result as v 0. Rewriting (5.4.6a) as 


— = - mVcp - m 


V cp - V V . _V c 


we see that the second term can be written as 

mvX (v X _Vcp) 

and is always at right angles to the motion. Such a force does no work. 


The term in vv. Vcp has no component at right angles to the direction of 
motion. The only force term with a component at right angles to the direc¬ 
tion of motion is 

- m(l + v^) Vcp 

and so we do indeed get twice the Newtonian deflection of light, on applying 
the treatment given in section 1.5. 
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We may thus infer that (5.4.4) should he written 

^Pn 


dh ^ ^ Bh 

—= mv V — mv V ^ ^ 

dt V ax u V ax 

P ^ 


(5.4.7) 


for the case of a general gravitational field. However, while is a four- 

vector, it is differentiated with respect to time, and on the right hand side 
we have 


V V 
^ V 


dt dt 


Thus although this equation is attractive, we must see if it is indeed Lorentz 
covariant. We may write 

dp dp ^ dT 

dT dt * 


dt 


The proper time T is an invariant and so dpp/dT is a four-vector. 


With 


% 

dt 


dT dt 


we have 


dT % dx^ ( ah^ 


^ p QT , 

dT ~ ™ dt dT dT ^ d 


3x^ J 


The quantity m is the inertial mass of the particle and 


At = At (1 -v^)"2 


= n.(l-v2)2 = . 


.2\i- 


We can therefore write (5.4.7) in the form 


dx^dx^.ah^ 

= “ -- dT I bx 


dT 


B h 


PV \ 


o dT dT 

which is indeed manifestly covariant 


3x^ J 


(5.4.7a) 


We may now calculate the acceleration of a particle in a gravitational field. 
We have the equations (5.4.6) which give the rate of change of energy and 
momentum and must combine them noting that because mass increases with velo¬ 
city the rate of change of velocity will be less than the rate of change of 


momentum. Given 


p = Ev = mv 


and 

as V -> 1 . 


dv jL ^ V dE 

^ E dt " E dt “ 


) 7 cp + 2v V . 7(p 


V 


dv p 

-“=-v.7cp 


0 


(5.4.8) 


(5.4.9) 
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But in the previous section we found that a consistent description of the 
propagation of light in a gravitational field could only be obtained with 


or c = 1 + 2 cp with 
cp ^ 


c = — = c(l + 2cp) 
cp n \ 

:1 far away from the source. 

^ (c2) = 2c = 4 ^ 

dt cp' dt dt 


Then 


with c 


cp 


Since 




c 

-cp 


dc 

-ZCQ 

dt 


dcp 

dt =-2cp--2'P 


2 c . Vcp 


(5.4.10) 


and this result should also hold for a particle in the limit v -» c . Eqs 


(5.4.9) and (5.4.10) are thus inconsistent, 
used in extracting (5.4.9). 


cp 

We must examine the assumptions 


Equation (5.4.9) was constructed from Eq. (5.4.7), using p = Ev. Eq. (5.4.7) 
in turn depended on the identification cx= - 1 . This was obtained by insist¬ 


ing that 


dE dp 

E — = P . — 
dt - • dt 


which in turn depended on the relation 

E2 - p2c2 


m^ c'^ 
o 


In special relativity the relation p = Ev yields 

111 
E 


m c 
o 


j 


(5.4.11) 


1 -• 


This relation gives us the clue to the source of our inconsistency, for 
Eq. ( 5 . 4 . 11 ) implies v ^ c . In a gravitational field we find the local 
velocity of light is reduced, and we have no hope of finding the same laws 
of physics in all freely falling frames if particles can outrun light in a 
gravitational potential. If we substitute for (5.4.11) the relation 


E = 


m c 
o 


the maximum particle velocity is then 


2 2 
n^v'^ 


c 

c = —. 

cp n 


(5.4.12) 


Keeping 


E^ - p^ c^ = m^ c^ 
^ o 


we obtain (5.4.12) by setting 

£ = nEv = (l- 2cp) E V 


(5.4.13) 


and here we are departing from familiar special relativity. Then 
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dv (j^ dE 

nE— = — - vn — 

dt dt — dt 


E V 


dn 

dt 


so that to first order in the potential cp we have 

•— = - (1 + v^) Vcp+ 4vv.7cp ( 5 . 4 . 14 ) 

dv P 

v-Tif = - V. Vcp+ 3v^ v.Vcp 


which indeed gives the correct deceleration as v 1 and we have at least 
established consistency with Eq. (5.4.10), to first order in cp , and have the 


equations 


dp 

•— = - E(1 + v^) Vcp + Evv.Vcp 
dE 

— = - E V . 7 cp 

dt — — 


together with new kinematic equations 

p = n Ev E^ - p^ = 


which may be written 


m n V 
o — 


VlTTn^v^ 




(5.4.15) 


(5.4.16) 

(5.4.17) 


The second of Eqs. (5.4.17) shows that the quantity n m^ is playing the role 
of the inertial mass. The interpretation of these equations is however a 
little more complicated and it is instructive to consider the effect of the 
gravitational potential on the physics of a small region of space. 


5.5 The effects of local forces 

Consider a particle in a gravitational field which is acted on by some 
arbitrary force F , for example an electron in an atom. The equations of 
motion are 

- E(1 + v^) 7 cp + Ev V . 7cp + F ( 5 . 5 . 1 ) 

- Ev . Vcp + V . F . ( 5 . 5 . 2 ) 

If F = 0 then to first order in cp 

The quantity E(l + cp) is then conserved in free fall. Similarly, to first 
order in cp : 
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Evv. Vcp = -^ |p (1-cp)] 

since 

<5cp 3cp 

at = aT 

This suggests that when a force F is acting we should write 

^ |E (1 + cp) ] = V . F . 


(6,5.3) 


The quantity E(l + cp) is then acting as the energy in so far as the response 
to local fields is concerned. In the absence of local fields other than 
gravitation (that is, in free fall) this quantity is conserved, corresponding 
of course to the compensation of potential and kinetic energy. 


This identification tells us that if we lower a system gently on a string into 
a gravitational field, its energy goes as E E(l+cp) : this is the identifi¬ 
cation we made to calculate the redshift for Newtonian gravity. We therefore 
write the energy of a particle of velocity v in a potential cp as 

m (1 + cp) 

E^E =E(l + cp) = f== . (5.5.4) 

Similar reasoning suggests that we may also write 


^|p(l-cp)| = - E(1+v2) Vcp + P 

and if F is large in comparison with the gravitational force the quantity 
£cp = P(l - cp) would then act as the momentum of the particle: steadily lower¬ 
ing an atomic system into a gravitational potential changes its response to 
the force and we set 


(l-cp) nvm (l-3cp)m v 

P-P,, = P(l-cp) . " . 

^ Vl - n^^ ^ 1 _ n^^ 


(5.5.5) 


We may interpret this as giving us a rest mass which is dependent on the local 
gravitational potential 


m -^m =m(l-3cD) 

O 0(p o^ 


Since cp is negative, mass increases in a gravitational field and indeed a 
particle may be slowed down despite its increasing momentum . 


With these new definitions of energy, momentum and rest mass, we find 
“ocp c2(l + 4cp) = c2(l + cp) 
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to be compared with 
rest mass energy. 


E = E(1 + cp) : the quantity 
Further 


E 

cp 


2 2 

- V 

Cp cp 



and of course 


P = 

-tp 


E = 
cp 


Vl - vV 

cp 

“o 

CO cp 


l-vVc 



is acting as the 


(5.5.6) 

(5.5.7) 


(5.5.8) 


Thus all the kinematic formulae of special relativity emerge as being locally 
true in a region of space where cp does not change perceptibly. 


We can also write the free-fall equations as 


- E(l+v2)_ycp = - E^(l+v2) Vcp , 


to first order in cp ^hich with 




E V 

CD- 

cp 


yields again 


= - (1 + Vcp + 4vv.7cp 


where the last term comes from the variation of with the potential cp. 


The identification m = (l-3cp)m implies the existence of acceleration 
®cp ® 

dependent forces in a gravitational field, which we have not built in. we 
return to this point in section 6.5: we may wonder whether inertial mass is 
in fact due to the gravitational potential of the universe (Mach's principle) 
[4] . 

5.6 Gravitational deflection and gravitational redshift 

We have now reconciled the apparently discordant observations that the 
gravitational redshift is found to be in accord with elementary reasoning, 
while the gravitational deflection of light is twice the naive value. The 
force acting on a particle has a relativistic term of order (v/c)^, one 
piece of which acts in the direction of - Vcp and the other piece of which 
acts along the path of the particle. The gravitational deflection is due to 
that component of the force which is normal to the path and this is multiplied 
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by a factor (1+^), giving the required factor of 2 in the deflection of 
light. An unwanted factor in the energy is avoided because the two pieces 
of the relativistic correction taken together always act at right angles to 
the motion and so do not affect the energy. 

If we adopt a wave picture for light in our flat space coordinates we must 
argue in the following way. The gravitational potential acts as a dielec¬ 
tric, and in a time independent gravitational field the dielectric constant 
is not explicitly time dependent. Consequently the frequency is a constant. 
The frequency with which the light is emitted must therefore be shifted by 
(l + cp) , For a concrete example, suppose we lower an electron and positron 
into the potential. The rest energy of the pair changes by (l + cp). Setting 

hv = mc^ we see that the annihilation photons are shifted in energy and fre¬ 
quency by (l + cp) with respect to annihilation photons produced outside the 
potential, and do not change their total energy in falling freely out of the 
gravitational field. 

The frequency of radiation produced classically reflects the frequency of the 
oscillator. We must therefore find that the frequency of an oscillator in a 
gravitational field is reduced by the gravitational potential: for a consis¬ 
tent picture clocks run slower in a gravitational potential. In both the 
tensor and scalar theories we have discussed, the mechanism which arranges 
this also affects the lengths of measuring sticks : gravitational potentials 
distort the instruments we use in surveying reference frames. This is the 
subject of the next chapter. 
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CHAPTER 6 

THE DISTORTION OF REFERENCE FRAMES 


6.1 Introduction 

In the last chapter ive found that a gravitational potential changes the 
velocity of light so that 


c 




and also changes the rest energy of an object 

E - = E(1 + cp) 


the latter relation implying that the frequency of a periodic motion is 

changed by a gravitational field 

ui) a)(l + cp) 


and hence that clocks are slowed down. 


If we wish to elevate the principle of equivalence of inertial and gravita¬ 
tional mass (perhaps better called the principle of unique gravitational 
acceleration) to a principle which states that 

” In all localised freely falling frames the laws of physics are the 
same, Lorentz covariant and contain the same numerical constants ”, 


then we must find the same numerical value for the velocity of light in any 
freely falling local frame. Far away from the source of the gravitational 
field we use a clock to time light over a path in the laboratory measured 
with a measuring stick. The whole laboratory is now moved deep into the 
gravitational field and the measurement repeated. With respect to the refer¬ 
ence frame far away from the source, light has been slowed down, 


c -* c 


cp 


c(l + 2 cp) 


and the clock has been slowed down but by a smaller 
velocity of light .is determined by the ratio of the 
stick to the number of ticks N of the clock while 
one end to the other, N u) 


If 

then 


® = I + 'P) 

cp 

L - = L(1 + cp) 


N^ = ^(1.2cp) 


amount (i) -► a)(l + cp) . The 
length of the measuring 
light is travelling from 
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and the measuring stick is contracted so that a measurement of c^ using 
standard measuring sticks and clocks inmersed in the same field will yield 
the same numerical result for the velocity of light as an identical experi¬ 
ment performed far from the source of the potential cp . 


An alternative way of stating the same result is the following. We have 
inferred that atomic clocks will he slowed down, uuui)(l +cp). A clock 
could also he constructed by bouncing a pulse of light backwards and forwards 
between two mirrors of known separation. If the laws of physics are the same 
in any local freely falling frame, this clock must maintain synchronism with 
an atomic clock as the pair fall into a gravitational potential. Light is 
slowed down, c c (l +2cp) and so the light clock will be slowed down by this 
amount (^ + 2cp) unless the structure linking the mirrors shrinks. 

If L L (1 +cp) then + atomic and light clocks measure 

the same local elapsed time. 

We must therefore examine the physics of atoms in a gravitational potential. 

We expect to find that atomic frequencies and dimensions are both changed by 
a factor (1 + cp) . 


6.2 Atoms in gravitational fields; the change of scale 

We consider an atom deep in a potential cp and ignore the gravitational 
force (which we can always do by having the atom in free fall) and the tidal 
forces which for an atom should be negligible. We then have for the momen¬ 
tum of a particle of mass m 

® (1 - 3 cp) V 

and = (l-3cp)m. The change of mass alone is sufficient to change the 
atomic structure in the presence of a gravitational potential, but it cannot 
be the only effect. 


It is adequate for our purposes to use the simple Bohr model of the atom. The 
equations governing atomic structure are 



and the Bohr condition 



r 


E = i iitv^ 


m V r = h . 



r 


( 6 . 2 , 1 ) 

( 6 , 2 . 2 ) 


(if we were constructing a classical model, (6.2.2) would be replaced by 
mv r = constant, conservation of angular momentum.) 
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The characteristic dimension of the atom, the Bohr radius, is given by 


=- r 

me 


and 


me 

h2 


me 

h3 


(6.2.3) 

(6.2.4) 


We expect the electric field that holds the atom together to be modified by 
the gravitational potential, for the electromagnetic energy-momentum tensor 
is coupled to the potential h • The effect of this coupling on electro- 

jJiV 

magnetic waves could be represented by an effective refractive index n=l-2cp, 
We also know that the electrostatic energy of a system varies as (l + (t) from 
the EO'tvd's-Dicke experiments. The energy density in an electric field is 


eE^ 
8 TT 


and 


(6.2.5) 

( 6 . 2 . 6 ) 


V. (eE) = 4 ttp 
in classical electromagnetic theory. Then 

lEl ~ 

er^ 

and the energy £ stored in a system of dimension r is 

F 

er 

Such a system could be a capacitor, or perhaps an atomic nucleus. If we let 
e take up the effect of the gravitational potential on a given field then the 
effect of e together with the change in r caused by the gravitational 
potential must cumulatively change £ by a factor (l + cp) . If therefore the 
dimensions of a physical object vary as (1 + cp) , as we have guessed, then 

e = l-2(p. (6.2.7) 

Substitute this factor in the equations describing the atom by replacing e^ 
hy eVe in Eqs. (6.2.1) and find 


h^e 




We also must replace m by m(l- 3(p) to find the dimensions of the electron 
orbit in the potential (p and find at once 

rg - rg(l +cp) 

as we expected. We also have ^ ^4 

E , u) ^ (1 + (p) m e^ 

e 

and so have achieved an entirely consistent picture. 




74 


eiAVITATION AND RELATIVITY 


We may also note that with magnetic energy density given hy 

IJH^ 


( 6 . 2 . 8 ) 


Sti 


and 



Tvhere c is the velocity of light far from the source of the gravitational 
field, 



and the magnetic energy ][I\ of a system varies as 


e V 


(6.2.9) 


With r ~ (l + cp) and v~|~ (l + 2cp) , then (1-2 cp) if the magnetic 

energy of a real physical system is also to vary as (l + cp) . We therefore 

find that we have a completely consistent picture if we set e= (i=l-2cp and 
this moreover yields the effective refractive index we require for electro- 

magnetic ^^aves, through the relation 

n = ^/7^^ . (6.2.10) 

The quantities e and play their usual roles for both static fields and 
radiation, and an atomic system is indeed shrunk and slowed down as it is 
lowered into a spherically symmetric gravitational field. A capacitor 
shrinks and its electric field is diluted so that & (l + (p) and we infer 
from the Fdtvos-Dicke experiments that a nucleus, held together hy forces we 
have not considered, shrinks in the same way. 

In our original attempts to calculate the deflection of light in a gravita¬ 
tional field we followed two approaches which gave the same results. We 
first adopted as an axiom the principle that light travels in straight lines 
in a freely falling reference frame. Secondly we evaluated the deflection 
hy calculating the change of momentum due to a Newtonian force. These two 
approaches gave the same answer, wrong hy a factor of 2. We can now see 
that with the picture of gravitation we have developed light is deflected hy 
twice the Newtonian amount and yet still travels in straight lines across a 
freely falling hox. A system of measuring rods is distorted in a gravita¬ 
tional potential so as to remove that piece of the curvature of light that is 
not removed hy the gravitational acceleration. An observer located in such 
a reference frame has however no way of detecting this distortion. 
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In a distance Ax the gravitational deflection in the tensor theory is 

2GM 


Aa = 


2 2 

r c 


Ax . 


( 6 . 2 . 11 ) 


Half of this disappears if the reference frame is falling freely. 


Now the deflection (6.2.11) is measured with respect to an undistorted refer¬ 
ence frame a large distance from the source of the gravitational potential. 
Consider a box which when surveyed in this Olympian reference frame is cubical. 
Take the same box and immerse it in the gravitational potential. From the 
point of view of the Olympian observer, the bottom is shrunk more than the top, 
but the local observer in the box has no way of telling that his certified 
rectangular frame is bent like a banana. He counts the same number of atomic 
diameters at the top as at the bottom, and he finds light takes the same time 
(as measured by a local clock) to cross the bottom as to cross the top. If 
the box is of side AL in the Olympian frame then the top is at a potential 
CD and is of side 

AL (l 


and the bottom is of side 


r 


AL 




AZ. (U I 2 ) 


ALd.^B ) 
c2 

Fig. 6 .2.1 The distortion of a nominally rectangular box 
by the differential shrinking of top and bottom in a 
gravitational field 


L 


Ax =Ay =A^(1*^) 


The angle between the walls is thus 

Acp 


— AL 7 -- - 5 - Ax 

Ay ^ 


d cp 


GM 

2 2 

r c 


Ax 


c- c 

If light is launched at 90° to one wall in a freely falling frame it impinges 
on the other wall at 90°. The local observer thus claims that it has trav¬ 
elled through his reference system in a straight line. The Olympian observer 
agrees, and constructs the gravitational deflection of light to be observed at 
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large distances from the acceleration of the observer plus the distortion of 
the local frame. 

If the local frame is not freely falling, but is at rest, the local observer 
see a deflection 

GM 

Aa = Ax (6.2.12) 

r c 

and vill observe a reading GM/ r^ on an accelerometer. These observations 
are in accord with the observations that would be made in a space vehicle with 
the same accelerometer reading. It is thus the distortion of measuring rods 
by a gravitational potential that has introduced a new feature and has allowed 
locally equivalent physics in a gravitational field and a space vehicle and 
yet twice the originally expected deflection of light. The flat space coordi¬ 
nate system we have been working in is perfectly well defined, but is unobser¬ 
vable in the context of present day physics: we would need something like the 
hyperwaves of science fiction to detect by local measurements the distortion 
of standard rods and clocks. 


We may examine what would happen with either the scalar theory or the Newtonian 


theory by writing for a spherical field 

^ GM 


h = 
IJV 


0 

0 

GM 


0 

m 

r 

0 

0 


0 

0 

m 

r 

0 


r 


(6.2.13) 


cp = - V 

where a = +1 for the tensor theory, a =-l for the scalar theory and a=0 
for the Newtonian theory. Then Eqs. (5.4.5) become 


dp 

— = - m (1 + a^v^) Vcp - aa mvy . _Vcp 
dE 

— = am V . V cp 


In order to 


satisfy the relation 



£• 


dt 


(6.2.14) 


we must set a = - 1 


when we have 





THE DISTORTION OF REFERENCE FRAMES 


77 


— = - m(l + av) 7cp + amvv.Vcp 
dE 

— = - m V. Vcp . 

In a scalar theory there is no deflection of light because a =-l. We may 
therefore set p = Ev (i.e, £ = nEv with n=l) and find no acceleration 

as V c . 


In the scalar theory we therefore have 

E® = E(l+cp) 

p® = P (1 -acp) = P (1 +cp) 

c = c . 
cp 

The mass therefore goes also as (1 + cp), and for an atom 
r -- 2 , E , 0 ) - m e . 

Since light is not deflected the effective dielectric constant is plausibly 1 
and so r^ ^ (1 - cp) and E , uu(1 + cp) . That is, lengths are expanded and 
clocks are slowed down. We clearly get the same gravitational redshift. 


Light again goes straight across a freely falling box, as seen by an observer 
in the box, because although from the point of view of an Olympian observer 
light is not deflected, the bottom of the box is expanded more than the top, 
inducing the opposite effect from that obtaining with the tensor field. 


In the Newtonian case, we have a=0. We require n = 1 - cp and set 


P =nEv =(l-cp) Ev . 

Then 

dv 

— = - 7cp +2vv.7cp 

and as v -♦ 1 

^ 2 

which is the correct deceleration. 


We then identify 

E^ 

cp 

N 

Pep 

N 


E(1 + cp) as usual 

(l-acp) nEv = (l - cp) £ and m^ 
c(l + cp) 


(l-cp) 
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With e =(l-cp) and m = m(l-cp) the Bohr radius r^ is unchanged. Once 
more E , O) me*^ ^ me ^ 1 + cp. In the Newtonian theory (an equal mixture 

of tensor and scalar) there is no distortion of measuring rods and the elemen¬ 
tary reasoning of Chapter 1 survives. Clocks, of course, are still affected. 


6.3 The weight of an atom 

We have been working with the fields and responses of point masses in 
the last few chapters and have found that the gravitational field couples to 
the energy-momentum tensor, which for a particle integrates to 


IJiv 


= E —c 


dx. 


dt dt 


In both the scalar and tensor cases the force is a function of velocity: in 
particular, a particle moving at right angles to a field - 7cp experiences a 
force 

-m(l+v^)7cp tensor case 


-m (1 - v^) 7cp 


scalar case . 


We have tacitly been supposing that a composite bound system like an atom 
behaves in the same way as a particle: indeed it is clear that this must be 
the case from the Ebtvbs-Dicke experiments. In particular we saw in Chapter 2 
that the gravitational and inertial masses associated with the kinetic energy 
of the nucleons in a nucleus must be equal to within 1 part in 10^° . The 
typical momentum of nucleons in a nucleus is given by pr ~ h, with 
r 10"^^ cm . The velocity of such a nucleon is ^10® -10^° cm s“^ . The 
relativistic correction to the gravitational force, acting on such a nucleon, 
clearly does not manifest itself when the nucleon is bound: something can¬ 
cels these terms. 


We have a hint as to what this could be. Considering only free particles, 
we postulated that the gravitational field couples to the conserved energy- 
momentum tensor. Now the energy-momentum tensor of a little piece of a boiind 
system (like a nucleon in a nucleus) is not conserved. Rather than write 



3: 


liV 


= 0 


we must write 


where 3^^ is the force density, 

K such that 
MV 


dx 


J = 3^ 
MV V 


(6.3,1) 


The conserved tensor will be some construct 
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-sl-K -3=0 

and if gravitation is to couple to a conserved quantity, it is to 
it must couple. 


(6.3.2) 


K that 
gv 


For a nucleon in a nucleus, the kinetic energy is not conserved, but the sum 
of kinetic and potential energy is conserved: it is to this that gravitation 
couples. What is the analogy for momentum? 


Consider the nucleons in a nucleus as a gas confined within a box. For a 
single particle we have Eq, (5.4.4) 

-rr^ = mv v h . -mv.v h^_ . 

dt V dx gv g pv 

P g 

For dp/dt we extract 

” dp 

- m(l + v^) V cp + m V V . V cp 


where the term in -m Vcp comes from h^ and the other terms come from 

h fi = 1,2,3). The sum of mv'.v. over all nucleons is clearly not equal 
to zero, but reflects the mean square velocity. However, the sum over all 
nucleons of terms like mv.v. denotes the rate at which momentum is incident 
in a given direction. At the walls of the nuclear box this momentum is 
turned around by the presence of stress in the walls (external forces as far 
as nucleons confined in a box are concerned). On integrating over the whole 
nucleus the sum of the pressure exerted by the nucleons and the stresses con¬ 
fining them must be zero for a bound system. For a system at rest the parti¬ 
cle part of is just the pressure: gravity couples to both this and the 

stresses, and on integrating to get , the two parts cancel for a bound 

system. 


We can put the whole thing in a picturesque form by considering the equation 

dp 

= - m(l+ av2) Vcp + F (6.3.3) 

where a = +1 for the tensor theory, and -1 for the scalar theory for a 

particle in a gravitational field. The force F represents the ’spring 

balance’ from which it is hung: in equilibrium 

F = m(l + av^) Vcp . 

Confine a gas of such particles in a box of volume V . Let there be n per 

unit volume. Then F = nm V(l + Vcp (6.3.4) 
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if ve take no account of stress. The term nmv"^ gives the pressure exerted 
on the \valls nmv^ = 3P . 


If the box is lowered an infinitesimal distance into the potential its size 
changes (from the point of view of the 0Iyii5)ian reference frame) and conse¬ 
quently 

VV(1 + 3a Acp) or AV=3VaAcp 


since L 


Thus 


L(l + acp) in a field cp . Then 
P AV = nmv^ aV Acp 

= nmv^ aV_Vcp, Ax . 

F, Ax =|nmV7cp. Ax+P AV j 


(6.3.5) 


represents the work done in moving an infinitesimal distance x , It is only 
the first term on the right hand side that corresponds to weight, for the 
second term is cancelled by the stresses that hold the complex system together. 


By making measurements in a laboratory within which the gravitational poten¬ 
tial does not vary significantly, we cannot distinguish between the scalar 
and tensor theories, or indeed any combination of them. It is only by making 
measurements of relativistic corrections, over a region of space in which the 
gravitational potential changes substantially, that we can pick out the 
correct generalisation of Newton’s theory of gravitation. 


6.4 Covariant equations of motion 

In a spherically symmetric gravitational field we have inferred that 

the equation of motion of a particle is 

dv / 2\ 

- (l+v ) vcp+ 4V V. 7cp. 

We would like to find the covariant equations corresponding to this. A 
crucial ingredient was our recognition of the slowing down of light which, 
again in a spherically symmetric field, we were able to express by setting 
c — where n = 1-2 cp . 


In a more general case, it is best to start from the manifestly covariant equa¬ 
tions 


dT 


dx 

O dT 


dx 




dT ^bx [XV 


3x^ “pvij (5.4.7a)(6.4.l) 


which we derived in section 5.4 as a result of setting 




p = Ev 


and 
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together with E ^ = m (relations we have already been forced to modify). 

Qt O 


In relativistic mechanics the momentum-energy four-vector can be written 

E = m 


Pp = “o -df 
dx. 

p. = m_ 


- dx dx 
dt 


b dT 


1 o dT 

With these identifications we may write 

dx^ dx^ 

2 P P 2 

p p = m = - m 

* jQ"*" p ^ ^ 

and 

dT 

“ dt dT at ± 
the standard results. Maintain these identifications and write 


o dT dT 
dx. dx. 


E = E =Ev. , 

dt dT dt •• 


d^x dx dx ^ Sh ^ 

_ 9 ^ _v ^ _ _B^\ 

dT dT I- Sx 3 dc J 

H H’ 


(6.4,2) 


This is a set of four equations from which we can extract the acceleration ; 


Now 


d^x. 


dT^ 


d^x. 

dx 

dx , 

. 3h 

Sh. . 

_1 



_yv 

_1V\ 

dT^ 

dT 

dT ' 

L "bx. 

1 

J 

d^x. 

dx 

dx . 

r ^ 


4 



_yv 

_ 4v^ 

dT^ = 

dT 

dT ^ 

1 

Bx J 

. dx. 
_L 


d 


d^x. 

1 

dT 1 dt 

dT J 

d 

t 2 dt 

" dt^ 


With X. = it we obtain 
4 


d2x. 


dt'^ 


which can be written at once as 
d^x. 


dx 

dx 

, ah 

ah. , 


_V 

!__tw _ 

IV ] 

dT 

dT 

1 dx. 

1 


dx 

dx 

f ^ ^ 



_V 

i “ 

4v\ 

dT 

dT 




dx 

_y 

dx , 

r dh^^ 


■ ^uv 

dh. -n 

4v 1 


“ dt 

dt ^ 

L dx. ” 
1 


- 3*4 “ 

dx 1 

y 

dt J 


. (6.4.3) 


Specialise to the time independent spherically symmetric field and find 
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Tvhich is of course the answer we obtained setting £ = Ev , Eq. (5.4.9), which 
we found to be unsatisfactory. ¥e must modify the kinematic equations in a 
covariant way. 


Let us keep the relation 


and set 


p p = - m 

dx 

p = m X . 

o pa dT 

We see from Eqs. (6.4.2) and (6.4.4) that we need to obtain equations which 
look something like 


which is equivalent to 


d^Xp 

dx,, 

dx 


dT^ 

_B 

_V, 


dT 

dT 

L "dx 

P 


dx 

dx , 

r dh 

_ B 

_B 

— 


dT^ 

dT 

dT 1 



Sh 


- 2 


Vl 


Sx 


1 




^h. 


IJl V 

Then on differentiating the momentum with respect to T , we have 




dx 


dX 


_. ^ V _y _y _ 

dT O P(J dT O 




dT 


d^x 


dx dx dX^- 

„ V C7 a p a 

m K - T-" + m ' , ■' —z — 

® P^ dT^ o dT dT ^ 


(6.4;5) 


Identify = 6^^ 


+ h and obtain 

pa 


(6 +__ 

^ pa p 




dT^ 


dx,, dx ^ 3h 

_y_V I_ 

dT dT I Sx 

p 


Sh 


( 6 - 4 - 6 ) 


This is a manifestly covariant equation, and differs from our desired form only 

in the coefficient of the quantity d^x / dT^ . To first order in h we 

a lav 

clearly obtain the equation of motion we are looking for. The correction term 
on the left hand side corresponds to an acceleration dependent four-force, 
determined by the gravitational potential. 


We are now happy that the equation of motion (5.4,14) in a spherically sym¬ 
metric potential characterised by cp is, to first order in cp, obtainable from 
a respectable general equation. Let us investigate the implications for the 
energy and momentum. We have 

2 

p p = — m 
^P^P o 

dx 

^p ” °^o ^pCX dT 
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These are manifestly covariant equations; we may write 

dx 

0 

^ /I ^ • 

O UT 




In the relatively simple case of diagonal h^^ we have 

dx. 

p. = m (1 + h. .) 

o^ ii' dT 


m (1 + h.. 
o ^ 44' dT 


Then 


-1 dx 

p = (l+h..)(l+h44)-lE^ 


and if we have p^-E^ = -m^ , then 
^ o ’ 


E = 


Jl- (l+h^^)^(l+h^) 2 v^’ 


which again agrees with our previous identifications. A related, hut much 

more complicated equation would obtain for general h . 

H’V 


It is worth noting the implication of the relations 

v,2 


dx 


up = - m 

o 


p = m X . 

O PCT dT 


These can only he true, with m^ an Olympian frame invariant, if 


X X_ ^ dx 

pa a pa a 


dT^ 


(6.4.7) 


We recover the standard kinematic relations of special relativity on effecting 
a redefinition of coordinates 


X dx dx 

pa a p 

which matches with our earlier identifications ^ op ^ ^cp 

for the time independent spherically symmetric field; a length L ^ is mea¬ 
sured in the Olympian frame and is ^ = (l + Cp)L where L is measured very 

far away from the source of cp . The quantity 

dx. = X: 

1 ^a a 

is X . L = (1 -(p)L = L (to first order in cp) . Write Eq, (6.4.7) in the 

IQCp Cp 

form 

(l+h..)2 <^2 _ (l+h^)2dt2 =_ dT2 . 

If (^t)^ = 0 as it is for light in standard special relativity, then 
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1^1 = ^ 

(1 + 2cp) 

for the tensor theory. More generally 

\ 2 _ / /I n-\2 




So that 


At = 


At 


^(1+1144)^ ■ (1 ^2 




^/l - (l + h..)2 (l + h^J-2 v2 

'' 11^ ^ 44^ 


(6.4.8) 


•which is the usual time dilation result modified by the factor (l+h^)“^. 


¥e may also note that with dj^ = 0 

(l+hii)2 dx^ + (l+h^g)^ <ly2 + + dz^ = (l + h^)2 dt^ 

for the case of a diagonal h^^ but not necessarily a spherically symmetric 
field. The velocity of light in the x direction is not necessarily equal to 
the velocity of light in the y direction; it is clear that a non-zero h^^ 
affects lengths in x and so on. Indeed we can see that the scalar, Newtonian 
and tensor theories slow clocks down in the same way (h^^ = cp) while the 
Ne-wtonian theory leaves lengths unchanged = O) • The tensor field shrinks 

lengths 0^^^ =“ cp) while the scalar field expands them = Cp) • 


We may now proceed one step further. If we write 

dx 

p =(6 + 2 h )m 

^Cpp ^ po po^ o dT 


and 




dT 


dx dx i dh 

U _V \ ^iav 


o dT dT ^ 


we obtain for the acceleration equations in free-fall 


(6,4,9) 


(6,4.10) 




d^x dx dx (■ dh 
_g _ u _V [ uv 

dT^ "" dT dT ^ 



(6.4.11) 


which to first order in h^^ give us once more the equation of motion in a 
spherically symmetric field. The advantage of this identification is that 
in the absence of external forces other than gravitational the quantity 
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is conserved so long as vanishes and 


dx 

p =(S. + 2 h. )in —^ 

^10 la' o dT 


(6.4.13) 


is conserved so long as 9h^^/ vanishes. It is these quantites that 

behave like momentum and energy when external forces (for example electro¬ 
magnetic forces) are applied in the presence of a gravitational potential. 
For the case of diagonal h 

Liv 

(1 +h.lvl) 

44^ 


liV 


E = 


'p Vi - (i + h,')2 (i + h^)- 


44/ 

-1 


2 ^2 


Vl - (1+h )2 (l+h^)"2 


(6.4.14) 

(6.4.15) 


Our calculations in Chapter 5 and Chapter 6.1 and 6.2, augmented by the consi¬ 
derations of this section, demonstrate the machinery by which the gravitational 
field inserts the extra factors not contained in special relativity. The 
changes of scale implied by (6.4.7) must not be added to those calculated with 
modified expressions for mass and energy, for they are a different statement 
of the same physical effects. 


6.5 The Lagrangian formalism; again for experts 

The manifestly covariant equation (6.4.11) embodies all the results of 
Chapter 5 and Chapter 6, sections 1 and 2 in a more general form. This equa¬ 
tion may be derived by writing the invariant Lagrangian for a particle in an 
external gravitational field as 


L 


o dT dT 


+ m 


dx 

—1:^ 

dT 


dx 
_V 

dT 


liV 


(6.5.1) 


and feeding it into the covariant Euler-Lagrange equation (3.3.3); the accel¬ 
eration dependent forces emerge quickly and naturally in this formulation, 
which is based on the identification of the interaction 


h 3 
P-V pv 

with an invariant Lagrangian density. 


The modifications to the equations of electrodynamics necessitated by the 
presence of gravitation can also be worked out by augmenting the Lagrangian 
function for the free fields plus sources [l] with an interaction term, the 
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gravitational potential being coupled to the energy-momentum tensor of 

the electromagnetic field [2,3]. This approach both generalises our results 
to the general tensor field and automatically provides covariant equations of 
electrodynamics in the presence of gravitation. The results are valid for 
Tveak gravitational fields but can be extended to strong fields [2], yielding 
in the end Einstein’s theory. 
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CHAPTER 7 

THE PRECESSION OF THE PERIHELION OF MERCURY 


7.1 Introduction 

In the previous chapter have developed a picture of gravity which 
satisfies the uniqueness of acceleration in a gravitational field and yields 
the gravitational redshift, deflection of light by the Sun and radar echo 
delay in accord with the predictions of the general theory of relativity and 
experiment. We have also seen how such a theory can contain a principle of 
equivalence of physics in all local freely falling frames. We have thus 
dealt with three of the four famous tests of general relativity (or indeed any 
theory of gravitation). The fourth of these tests is the precession of the 
perihelion of Mercury. This phenomenon is of particular importance because 
it is the only currently available test of the nonlinear terms in this formu¬ 
lation of the theory of gravitation. 


A planet in orbit in a perfect inverse 
square field follows, in Newtonian mecha¬ 
nics, a path that is an ellipse, with 
axes fixed with respect to the absolute 
space of Newtonian mechanics, operation¬ 
ally defined as the reference frame pro¬ 
vided by the very distant stars. If the 
gravitational field in which the planet 
moves is not perfect inverse square, 
this is no longer true and, among other 
effects, the axes of the ellipse slowly 
rotate with respect to the distant stars. 
The perihelion of an orbit is the point 
of closest approach to the Sun and if 
the ellipse is rotating, the perihelion 
is slowly rotating with respect to the 
distant stars. (in the solar system the 
principal cause of such rotation is plan¬ 
etary perturbation . ) The easiest way 

of seeing what is going on is to con¬ 
sider circular motion about the Sun (or 


Precession angle 



Fig.7.1.1 The trajectory of a 
test particle in a precessing 
orbit (of extreme eccentricity) 
is shown for 2^ revolutions. 
The precession is greatly 
exaggerated 
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any other primary) and superimpose on this circular motion a small radial 
oscillation. If the periods for these two superimposed motions coincide, the 
planet returns to perihelion after sweeping out 2tt radians. In this very 
special case the orbit is closed and repeats every revolution. If the periods 
are not the same, but are commensurate, the orbit repeats after a certain num- 
ber of revolutions, while if the two periods are not commensurate, the orbit 
never repeats. 


If the period of the radial vibrations is Tj^ and the period for a rotation 
of 2tt is Tq , then the planet sweeps out 2Tr (Tj^)/Tg radians in time Tj^ . 
The angular advance of the perihelion each revolution is thus 


(see Fig.7.1.1). 


2 TT 



1 ^ radians 
J 


7.2 Perihelion advance in Newtonian mechanics 

We will first demonstrate that in Newtonian mechanics there is no pre¬ 
cession of the perihelion for motion in a pure l/r potential. 


The gravitation potential is 



and the equation of motion is 


dv 

dt 



(7.2.1) 


The orbit is planar and resolving (7.2.1) into radial and tangential components 
we have 


dfr 

dt^ 


r 



GM 


(7.2.2) 


dfe ^ de ^ 

dt2 dt dt - 


0 


or 


We can write Eq. (7.2,4) as 


i ^ 

r dt \ dt / "■ 


r 


2 ^ 
dt 


= h 


0 . 


(7.2.3) 

(7.2.4) 

(7.2.5) 


where h is a constant and is clearly the angular momentum per unit mass, 
since the angular velocity is d0/dt. Eqs.(7.2.3) ,(7.2.4) thus corres¬ 
pond to conservation of angular momentum. 
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this point represent a nearly circular orbit 


Since h is a constant of the motion -we can decouple the angular and radial 
equations by lo’iting (7,2*2) in the form 

o _ 

(7.2.6) 


dt 


2 “ 


m 

^2 


corresponding to one dimensional motion in the potential 




effective 


2r^ 


m 

r 


(7.2.7) 


Illustrated in Fig.7.2.1. The first term in Eq. (7.2.7) is a centrifugal 
repulsion of exactly the kind encountered in quantum mechanics problems. 


d^r 

dt^ 


R 


= 0 


For circular motion 
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and so 


GM 


(7.2.8) 


b2 

where H is the radius of the circular orbit. Introduce a radial oscillation 
by setting 

r = R + p 

when Eq. (7.2.6) becomes 


d^p 

h^ 

GM 

dt^ 

(e+p)3 

(E+p)2 

Expanding in powers of /[/R 



d2p 

f 3h2 

2GM'l . 

II 

ICM 

-P 


—T"! P 

and substituting for h^ from Eq 

. (7.2.8) 


d2p 

GM 


II 

loj 

k 

- P • 

R^ 



Then 


p = Po V ^ * 

and the period of small radial oscillations is 


2 IT 


/ Sl 

V GM • 


The tangential velocity is 


de _ h ^ 


(7.2.9) 


(7.2.10) 


dt r V r V B 
and the distance round the orbit is approximately 2rrR , so the rotational 
period is 


'0 


2tt /SI 
V GM 


(7.2.11) 


which is identical with the radial period. There is no advance of the peri¬ 
helion. 


It should be clear that this is a special case. Suppose that the l/r poten¬ 
tial is perturbed by a (central) potential with different radial dependence : 



m 

r 

cp = - 

r 


GM J 

(n + l)a 

R^ 


1 1 + - — 

R^ 


Then for circular motion 
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and the radial equation is 


and 



(with n=0 the two periods are the same, of course). The advance of the 
perihelion is 



( 7 . 2 . 12 ) 


The only example of this kind of simple perturbation that is of inter¬ 
est is for n=2, corresponding to a quadrupole field in the equatorial plane. 

7 3 The relativistic theory and the need for nonlinear terms 
In Chapter 5 we found an equation of motion 


(1 + v^) 7 cp + 4y V . V cp (5.4.14) (7.3.1) 


which reduces to (7.2.1) as v-0 and gives the correct deflection of light 
by the Sun. This equation was derived working only to first order in the 
potential. As it stands it clearly contains departures from Newtonian mecha- 
nics of order v^ (with c = l) and we could at once apply the methods of the 
previous paragraph to find the precession rate of a planetary orbit due to 
these terms. 

However, for planetary motion 



and so the dimensionless quantities and GM/rc^ are of the same order 

of magnitude. If we are to work out the effects of relativistic gravitation 
on planetary orbits, we must include corrections ~cpVcp along with those 
~v2 Vcp. We have already seen that the equations of the gravitational field 
are expected to he nonlinear, giving rise to terms ~ cpV cp in any equation of 
motion. We shall therefore study the advance of the perihelion obtaining 


with an equation of motion: 


Ov 

dt ^ 


= - (l+v^)^cp- aVcp? + 4v V. Vcp 


(7.3.2) 
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where cp has been replaced by cp(l +acp) in (7,3,1) and terms in v^cp^ , 
being O(cp^) , have been dropped. Our first task is to discover the appro¬ 
priate value of the coefficient oc. Some of this coefficient comes from the 
nonlinearity of the field equations, but we should note that even neglecting 
this nonlinearity we find a= 1 from the most respectable looking formula¬ 
tion of the free-fall equations in the tensor theory, Eq. (6.4.11). (This 
result is easily obtained on applying to (6.4.11) the procedures used to ex¬ 
tract (6.4.4) from (6.4.2) and keeping all terms up to second order in small 
quantities.) 

7.4 Strong equivalence and the nonlinear terms 

We are not going to attempt to develop a consistent nonlinear theory to 
even second order in the potential cp. We shall cut this Gordian knot, 
instead of unravelling it, by applying a form of the principle of strong equi¬ 
valence. 

We saw in Chapter 2 that inertial and gravitational mass are strictly propor¬ 
tional to very high accuracy for energy contained in electromagnetic and strong 
interaction fields. The Eotvos-Dicke experiments have little to say about the 
weak interactions and nothing to say about the inertial and gravitational mas¬ 
ses of gravitational fields themselves. A principle of equivalence states that 
in any local freely falling frame the same Lorentz covariant laws of physics 
apply, with the same numerical content. Such principles represent an abstrac¬ 
tion from special relativity plus the results of the Eotvos-Dicke experiments 
and so are reasonably sound for the laws of electromagnetism and strong inter¬ 
action physics but represent an assumption for the laws of gravity. The prin¬ 
ciple of strong equivalence assumes that all laws of physics, including gravity 
itself, are the same in any local freely falling frame. It was this postulate 
that Einstein used to arrive at the field equations of general relativity: we 
shall be less ambitious and use it to find the value of a in Eq.(7.3.2), assum¬ 
ing that a completely consistent tensor theory would contain the principle of 
strong equivalence (as is in fact the case). 

If the principle of strong equivalence holds, then on gently lowering a small 
solar system into a gravitational potential on a much larger scale, say a 
galactic potential, the characteristic dimensions and frequencies of the system 
must remain the same, measured locally. In the Olympian reference system atoms 
shrink and vibrate more slowly, light is slowed down and so the small solar 
system must shrink and slow down in proportion. As we lower our little solar 
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system into the galactic potential $ mb expect its linear dimensions to change 
as 1+ $ and its characteristic times to change as 1 - $ . Velocities "will 
thus vary as 1 + 2 $ , and the quantity 


Nov 


— 1 + 3 $ 

r 

iL ^ s. 

r r 


vhere cp is the internal potential of the little solar system and so ve must 

cp ^ 1 + 4 $ 
cp - cp(l + 4 $) . 


have 
that is 


The same result may be obtained by noting that the energy of a body of mass m 
in the potential cp is -mcp and that m varies as 1-3$, while energy must 
vary as 1+ $ for the principle of strong equivalence to hold. 


Now the quantity 2cp$ is the cross product tenn in (cp+ so consideration 

of the properties of a small gravitational system in a large scale potential 
suggests the general replacement 

Y - ^ + 2 

as the second order correction to a potential Y in Eq, (7.3.1). 


This makes sense. Consider two masses m in a field Y, separated by a poten¬ 
tial difference Their energy difference is mAY , Compare this with the 

difference between atomic energy levels at Y . This latter difference changes 
with Y as 1+ i and so the principle of strong equivalence is satisfied if 
mA^ changes as 1 +Y. Since m varies as 1-3 Y , A^ varies as 1 + 4Y . 

That is, 

ay- AY(1+4Y) = A(Y+2Y2) . 


In order to compute planetary motion to second order in small quantities, under 
the assumption of strong equivalence, we must replace the potential cp(=- —) 
by the quantity cp+2cp^ in equation (7.3,1); that is, set a= 2 in Eq.(7.3.2). 


7.5 Calculation of the advance of perihelion ^ 

We take as our equation of motion 

- (1 +v^) Vcp - aVq:? + 4v v . Vcp (7.3.2) (7.5.1) 

and begin by resolving into components. 

The angular equation is 

r dt v dt / \ dt / dt 3r 


(7.5.2) 
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It is important to notice that this equation, "which corresponds to conserva¬ 
tion of angular momentum, differs from the Newtonian equivalent, in that the 
right hand side is non-zero. 


Since the potential cp is not explicitly time dependent, we may write (7,5.2) 
as 


Set 


i = 

r dt V dt / 


4r 


d0\ ^ 
dt/ dt • 


(7.5.3) 


X r 


2 ^ 
dt 


h 


(7.5.4) 


where h is 
factor X . 


a constant and obtain an equation for the relativistic correction 


Differentiate (7.5,4) with respect to time 


_d_ 

dt 




aes 2 <30 cix 

at/ ^ at at “ ° • 


(7.5.5) 


Equations (7.5.3) ana (7.5.5) yieia 


or 


2 d0 dx 
^ dt dt 


+ X 



dx 

dt 



Since we need to work only to one order in small quantities higher than the 
Newtonian limit, it is sufficiently accurate to set 


(x must reduce to 1 


X = 1-4 (p 

(p-> 0 ) and the equation for angular motion becomes 



h{l- 



(7.5.6) 


Since we earlier made the identification 

(1- 3cp)in^ 

m = . . — 

Vl - (l-2cp)2v2 

(see Eq, (5.5.5)) it might be thought that the quantity 


would be conserved. 

Since 


(1-3CP) 

This is not the case. The angular momentum J is 

J = mvXr^ . 
d£ 

X dt =zxx = 0 , 

^ ^ eta ( ^ \ 

Oi = M + “^at • 


Equations (7.3.1) and (7.5.1) yield: 
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dv 


-^Xr = 4 (vXr) v. Vcp 


so 


dt 

6J 


(vXr){^ + 4m V, 7cp}. 


To first order in small quantities, 

m 

and so 


dv 


■^ = m ]v,-f-3v. Vcp) . 
dt o dt — “ ^ J 

The leading term in the expressions for ^/dt is - first 

order in cp, <y./dt is indeed zero, M±th vxr given by Eq. (7.5.6). 


The 


radial equation of motion obtained from Eq. (7.5.1) is 


$-(f/ -#] *‘(t) P • <’•-) 

For a circular orbit both dr/dt and d^r/dt^ are zero. For small radial 
oscillations superimposed on such a circular orbit, the terms in (dr/dt) 
will be very small. They are also anharmonic, do not affect the basic period 
and will cancel out over a few revolutions. We therefore drop them and obtain 

('de\2 J, ( d0y\ GM 2a/GM'\2 (7 r, n\ 

-Hdt) dti J 72+— 


d^r 

dt^ 


We now use Eq. (7.5.6) to substitute for dB/dt . Remembering that 

b2 .. B 

and that we need only keep terms up to 4(¥r ( becomes 


d^r 

dt^ 


. , i!) M . fXf)\ 4 (1 - « • (’•»■») 


We nov set r = R + p , vith R given by 


and obtain to order \~j 


d^r 

dt^ 


£!£ _ M -ii 9tM '! n 

dt2 = " r3 ■ B J P 


dr 

° ^ 


and hence a vibrational period 


^B = 


(7.5.10) 
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The terms in 
(7.5.9) with 


a disappear on substituting the value of 
d^r/dt^ = 0, which is 




(9-2a)f }. 


h 


obtained from Eq, 


(7.5.11) 


The orbital velocity is given by Eq. (7.5.6) 



and so the angular period is 



(7.5.12) 


The advance of the perihelion each revolution is thus 

T 

Ij = 2TT(5-a)^ (7.5.13) 

@ 

radians. 


With a= 2 , the value we inferred by applying the principle of strong equiva¬ 
lence, we obtain 6 tt (GM)/H radians per revolution, which is Einstein's 
result. With a= 1 , yielded by the tensor theory without any effects of 
self-interaction, we obtain 8 tt (GM)/E radians per revolution, 4/3 of the 
result from general relativity. 


7.6 The precession of the perihelion of Mercury 

The planet lying closest to the Sun is Mercury. The orbit of Mercury 
is thus most sensitive to the relativistic corrections we have been discussing, 
and in addition has the advantage of high eccentricity, e=: 0.206. This eccen¬ 
tricity does not affect the rate of advance of perihelion, to the accuracy of 
Eq. (7.5.13), but it does make the measurement of this advance easier. It has 
been known for over a century that after the effects of planetary perturbations 
have been removed, the perihelion of Mercury retains an anomalous advance of 
43" (seconds of arc) per century. 


Our expression for the advance per revolution is, after inserting explicitly a 
factor c^ which had previously been set equal to 1 for convenience 


GM^ 


6 TT 


m 


where is the appropriate radius parameter for the orbit of Mercury. 


The orbit of Mercury has a semi-major axis a= 0.3871 a.u. where la.u. 
(astronomical unit) is 1.496 X 10^^cm . The eccentricity of the orbit is 
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e = 0.206 

and so the semi-minor axis /-r* 

h = a VI -e^ 

is 2^ shorter than the semi-major axis. Onr simple treatment in terms of 
small radial vibrations does not tell ns directly ■what quantity to take for 
R^ . However, we note that the advance of the perihelion is given by 

— 6 TT pi 

where cpj^ is the potential at R . The elliptical orbit means that a small 
range of potentials is sampled in the course of the orbit and so we shall re¬ 
place cpj^ by (cp) , the average value of the potential, and hence evaluate 


® /I \ 

6 tt — 2 - < 7 ) • 


The equation of an ellipse is 


1 


1 + e cos 0 


where 9 
where -t 
the orbit, 

fey 


is measured from the major axis, so averaging over 6 , ~ ^ 

is the parameter known as the semi-latus rectum (the distance across 
normal to the major axis, measured through the focus) and is given 

t = a (1 - e^) 


or equivalently by 


Our final expression for. the advance of the perihelion is thus 


1 _ 1 JL_'\ 

^ 2 \r_„^ 


max 


min 


6 TT 


% 


(7.6.1) 


which is the result obtained by a more complete treatment of Eq. (7.5.1) with 
a = 2, which can itself he derived from general relativity, [l] . 


With 


we obtain 


I = 0.555 X 10^^ cm 
X 10^^ 


= 1.99 


gm 


6 TT— % = 0.501 X 10-® 


/t c 


radians per revolution. The period of Mercury is 0.241 years and so the rate 
of advance of the perihelion of Mercury given by the expression (7.6.1) is 43" 
per century. 


The precession of the perihelion of Mercury is measured with respect to an 
Earth based frame of reference. This frame cannot be directly related to the 
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frame of the distant stars because the rotation axis of the Earth does not 
remain fixed in such a frame but precesses with a period of about 26,000 years 
(the precession of the equinoxes) due to tidal gravitational forces acting on 
the quadrupole moment of the Earth. The observed precession of the perihelion 
of Mercury 5600" ± 0.5" per century) is made up from a piece due to the 

precession of the equinoxes 5025" per century), a piece due to planetary 
perturbations 532" per century) and the residual anomalous precession which 
[2] is - 43" ± V' which is in agreement with general relativity (otherwise 
known as the massless self-interacting spin 2 theory). The advance of the 
perihelion has also been measured for other planets but the results are accu¬ 
rate to no better than 10^^ . 

The agreement between the calculated and observed values of the centennial 
advance of the perihelion of Mercury thus provides a verification at the 2P/o 
level of the equation of motion (7.5.1) with a =2, rather than a=l as 
yielded by the spin 2 theory before allowing for additional nonlinear terms 
due to selfcoupling, and is the only test of these nonlinear terms . This 
conclusion is however only warranted if nothing has been left out: in section 
7.7 we discuss the quadrupole moment of the Sun. 

The recent discovery of a pulsar in a binary system [3] offers the possibi¬ 
lity of the investigation of much stronger gravitational fields than are easily 
accessible in the solar system. Pulsar PSR1913+16 has a pulse period of 
59 ms and is a member of a binary system with a period of 7.75 hours. The 

orbit is highly eccentric. The pulsar is presumably a neutron star with a 
mass ^M^ and the masses of the two components cannot be too different. There 
is evidence that the companion is much more condensed than a main sequence 
star and is therefore a white dwarf, a neutron star or possibly even a black 
hole. 

The -quantity 

^ « 10-® 

Rc^ 

thus making in principle the relativistic Doppler shift and gravitational red- 
shift variations in pulse frequency easily observable, and giving a periastron 
advance of several degrees a year. There is also the possibility of observ¬ 
ing the effects of spin-orbit coupling. 

•k 

It is however possible to construct a theory of gravity without self¬ 
interaction which yields the correct value for the advance of the peri¬ 
helion of Mercury LS. Deser and B. Laurent, Ann. Phys., 76 (1968)] 



THE PRECESSION OF THE PERIHELION OF MERCURY 


99 


7.7 The oblateness of the Sun 

In section 7.2 we noted that a precession of the perihelion would he 
induced, in the framework of Newtonian mechanics, by a perturbing potential 
varying with r as , with n non-zero. Such terms would be intro¬ 

duced if the Sun were not perfectly spherically symmetric, the only term of 
importance being the potential due to a quadrupole moment, varying with r as 
r~^ , A quadrupole moment would be reflected in the figure of the Sun: the 
disc of the Sun would he slightly oblate (flattened at the poles). Some 
oblateness is expected anyway because of the Sun’s rotation. Interest in 
this possibility was reawakened some ten years ago as a result of measurements 
by Dicke and Goldenberg which implied a quadrupole moment many times greater 
than expected on the basis of a uniformly rotating Sun and sufficient to con¬ 
tribute some 3" per century to the advance of the perihelion of Mercury [4]. 


In Newtonian theory the gravitational potential of a body of local mass density 


p is 


cp(£) = - ^ I 


p(r') d^r' 


(7.7.1) 


Expanding as far as 
body 



the quadrupole term yields for a rotationally symmetric 

- ^ (fcos^e- I) Jp(rO (|cos2@-i) r'2d3r' (7.7.2) 


where 0 is the polar angle of the vector r and © the polar angle of the 
vector r^ , both measured from the axis of symmetry of the quadrupole. It is 
convenient to write this as 

cp(r) =-f{l-P2(cose)Q(f)'} (7.7.3) 

where Q is a quadrupole moment parameter defined by 


Q = J p(r')(| cos2@-i) r'^d^r' . 

Mr 

o 

For an ellipsoid of uniform density the integral has a value 

I M(b2-a2) 

V 2 A 

and if a = b+ A (for an oblate ellipsoid), Q = 5 7 ^ • 

The Sun of course is not of uniform density, and the relation between the quad¬ 
rupole moment and the oblateness parameter A is more complicated. The effec¬ 
tive potential near the surface of the sun is the gravitational potential (7.7.3) 
plus a term giving the centrifugal force 
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Effective = ^ tI ^ - ^2 0) Q [-f) | - -i uJ^R^ (7.7.4) 

where E is a coordinate measured normal to the axis of rotation and uo is 
the angular velocity of the surface. To the extent to which the surface is 
maintained in equilibrium by pressure, the surface will follow the equipoten- 
tial surfaces of Eq. (7.7.4). 


Equating the values of 


cp^ff at the poles and on the equator gives 


Tq 2 ^ ^ 2 


0)2 


GM 


(7.7.5) 


In the case of a negligible quadrupole moment, the surface oblateness is deter¬ 
mined by the l/r field of the dense core and the centrifugal force and is 


A(Q = 0) 


0)2 


GM 

© 


(7.7.6) 


The surface of the Sun rotates in approximately 26 days so 


A(Q = 0) 

- 10 

r 

o 

and this corresponds to negligible quadrupole moment. The opposite extreme is 
represented by taking a uniformly rotating Sun of uniform density and setting 


2 _A 
5 r^ 


in Eq. (7.7.5). This yields an oblateness 


A (uniform) 5 

r = 4 "gmI” 

o © 


2.5 X 10“^ 


and a quadrupole moment Q = 10"^ . Any uniformly rotating model of the Sun 
will be much closer to the former case than the latter. In the equatorial 

plane, (cose)=-f so setting n= 2 and a = r^ in Eq. (7.2.12) we 
expect a centennial precession for the perihelion of Mercury « 1" from the 
solar quadrupole moment. 


Dicke and Goldenberg measured an oblateness of 4.5 X 10“^, corresponding to 
Q = 2.5 X 10“^ and a centennial perihelion advance of 3” for Mercury. Such 
an unexpectedly large quadrupole moment was attributed by Dicke to a rapidly 
rotating interior of the Sun, the outer layers being braked by interaction with 
the solar wind [ 5 ] . 
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However, a more recent measurement yielded an oblateness parameter in accord 
with that expected for negligible quadrupole moment [b]. The same work found 
evidence for periods of excess equatorial brightness to which the results of 
Dicke and Goldenberg may be attributed. The present position is that there 
is no evidence that any significant part of the advance of the perihelion of 
Mercury is due to a solar quadrupole moment. 
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CHAPTER 8 

GRAVITATIONAL WAVES 


8.1 Introduction 

The first step in the generalisation of the gravitational Poisson equa¬ 


tion 


yielded 


7^ h = 4 ttG p 


(4.1.8) 




T^hich at once led us to expect gravitational waves propagating with velocity 
c, solutions of the free field equation 

(,2 JL)^ 0 


□ Vv ■ ^ 

for a tensor field. This may he compared with the electromagnetic equation 

□ A = 0 . 

^ l-L 

In the electromagnetic case the source of is the current and we have 

(with c = l) 

□ a^= - 4ttP^ 

with the particular integral solution 

p p (r',t - Ir-r'l) 

A,(r , t) = I -j-n ’ 

« k-n1 

the familiar retarded potential. 


Our spin 2 tensor theory has 

rn h =-8TrG-[3 -Ls 3 1-= - 8 ttG 3 
[_) tiv " I liv ^ |J.v ooJ UV 

with the retarded potential 



( 8 . 1 . 1 ) 


which of course contains a radiative component with derivatives of the poten¬ 
tial falling off as l/r. 
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8.2 Transverse nature of the waves 
Because of the conservation law 


0 


we inferred for electromagnetism the Lorentz condition 

bA 


0 . 


For gravitation we have 




MV 




and so expect an analogous condition. But 

h, ~ i = {3 - i 6 3- I 

MV liv ^ ^Jiv ^ i-Lv a a J 

and so the analogous condition will NOT he 

3h 

_MV 




( 8 . 2 . 1 ) 


hut rather 


Bh 


MV 


dx. 


= 0 


( 8 . 2 . 2 ) 


where 


Since 


h = h -4-6 h 
MV MV ^ MV c 


II 

46 h 

MV 

MV 

MV era 

h JT - 

46 3 

MV 

MV 

^ MV aa 

h 

a: - 

i-6 „ 

oa 

aa 

^ act. P P 

-3 - 46 

3 - 4 6 

MV 

- MV 

aa ^ 


4 6 ? +46 6 

^ ijv oa MV aa pp 


= 3 : - 46 


MV 


46 {? +3 -46 

‘*Mv'-aa Ota ^otappj 


In these equations ci, a and p are dummy variables (summed over) and 6^=4 
(that is, + 6^2 + 6 ^^^ + 6 ^) so the curly bracket sums to zero and we have 


h = h 
MV 


46 h ~ ir 

MV Ota MV 


( 8 . 2 . 3 ) 


and hence infer 


Sh 


MV 


Bx. 


0 


(8.2.3a) 
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M^e knov that the Lorentz condition ^ related to the trans¬ 

verse nature of electromagnetic \vaves, so we might expect that Eq.(8.2.3a) will 
tell us something about the transversality of gravitational waves. In the 
electromagnetic case we have a four-vector potential A^ and the Lorentz con¬ 
dition BA /dx =0. At first sight this suggests three independent polari- 
sat ions for an electromagnetic wave — and we know there are only two (our 
martians of Chapter 3 probably found this out too). But consider a plane 
wave : 


A = a 


where a 


is a constant 4-vector. 


^i(k.x-u*) ^ ^ 


ik X 

. V V 


(8.2.4) 


and 




yields k k = 0 
V V 


yields 


But if a -> a + tk these conditions are still satisfied and so without 

li u , , 

loss of generality we can replace a by a where a = a + bk with b 

\J. |J* ^ \J» 

For a wave travelling in the z direction, the condi- 


an arbitrary scalar. 


tion k^a^ 


0 yields ^4 ^^3 • 


Then choosing a^ + bk^ = 0 we have 


ai = a^ 


= ®2 


and are left with only two polarisations, 
to setting 


a' =0 

a' = 0 
4 

Since 


A + - 

ji OT 


a + bk 


with B = - ib e 


ik X 
V V 


corresponds 


the equations of motion, determined by the fields 

BA 


E =- {vcp> jf] 


B = 7 X A 


do not notice the transformation a 




a^ + bk^ and indeed only two polarisec¬ 


tions are physically significant for a plane wave, and hence any wave. 


The potential transformation a -►a + bk is called a gauge transformation, 

|J. |J. 

Ll] , and is quite evidently intimately connected with the conservation of the 
source of the electromagnetic field. 
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With 


^3 = 


= 0 , 


hA 

TT 


and electrons exposed to the \iave vibrate in the x direction if driven by A^ 
only, in the y direction if driven by A^ only. In general of course the 
two components of the polarisation vector (a^ , a^) may have arbitrary magni¬ 
tudes and relative phase and.the familiar range of polarisation effects obtains. 
The vibration pattern essentially repeats under a rotation through 180° about 
the beam (with a change of phase of tt) and we talk about spin 1 photons with 
two polarisation states a^ , a^ or two helicity states ^ (a^ ± i a^) . 

The condition Bh /dx = 0 results in a similar behaviour in the case of 
PV PL 

gravity, and in fact this behaviour motivates the choice of the tensor theory 
rather than the Newtonian theory. 


Consider a plane wave 


h = e e 

piv liV 




In free space Q =0 so that 


k^kx = 0 . 


(8.2.5) 

( 8 . 2 . 6 ) 


h \ =0 

Sk IJV p.v oa J 


The condition 

yields 

If we now replace e bye^. = e +ek+ek (analogous to replacing 
^ pLV-^pLVpLVliVVpL^ ^ ^ ^ 

a by a^ = a + bk except that e has two indices and a has one) we 
pL-^pLpLpL^ PLV pL ^ 

find 


k G 
PL pLV 




k = 0 

aa V 


(8.2.7) 


k G^ - i: e' 


PL piV 


k=kG - i € k+kek 
2 era V PL PLV ^ gg V P pl v 


+ e k k - +k e k - +k e k =0 

V P P ^vgg ^vgg 


where the underlined pieces are zero by Eqs. (8.2.6) and (8.2.7). 


We can choose four quantities e arbitrarily and so go from ten independent 

P 

components of h through Eq. (8.2,3a) to six independent components, and 
through the gauge invariance to two independent polarisations in a plane wave, 
and hence any wave. These are conventionally chosen to be h^^ and h^^ 
a wave travelling in the direction. This can be worked out quite 

easily. 
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The condition (8.2.7) i ^ ^ ’ gives 

^22 ^33 ^44 


"3v 

(since k^ = k2=0) . 

V = 1 

V = 2 


)k =0 
/ V 


^3 ^31 ■ ~ ^41 


=*31 


- le 


41 


3 ^2 ' 


- luo e 


'42 


^32 " 


le 


42 


V = 3 : ikJe 


3'^'"33 " ^11 


- ^22 - ^ aa ) = - iWJ e 


^44' 


43 


"^33 - "^11 - "^22 


^44 = - ^43 


k- e. 


3 "'34 


+ luu e 


'44 


- 2 i^ + e^) - 0 


or (e^^ + 022 + £33 


^44) 


='22 " 33 

2i ^34 


44/ 


> (8.2.7a) 


We may rewrite the last pair of equations in the form 


hi + ®22 = ° 


®33 “ ®44 " " ®43 


(8.2.7h) 


These equations take ten conq)onents down to six independent components, and 
still hold when we replace with 0^^ 

0^=0 +ek+ek. 

1J.V p.V li V V \1 

Remembering that only k^ and k^ are non-zero, we find 


^11 “ ^11 


^22 “■ ^22 


H 2 “■ ^12 


Then 


'='31 


®31 + ®l‘"3 


^32 = ®32 + ®2'"3 


®41 = Hi ^ 

<2= "42 ^ ^‘""2 


®33 “ "33 ^"3*^3 


"34 = "34 + "4*^3 


'•43 


^43/ ^'"®3 

+ ^364 


644 = 644 + 2 iwe^ 


+ 1 uoe., 


( 8 . 2 . 8 ) 


Choose e^ , e2 , and to make the first column of (8.2.8) vanish. These 

are our extra four constraints, 
two rows of Eq. (8.2.7a) yield 


are our extra four constraints. With e^ and thus determined, the first 


= 0 . 


"'41 " "'42 

With 0^^ set equal to zero by appropriate choice of e^ and e^ , 0^^ i® 

automatically equal to zero. The second of Eq. (8.2.7b) then yields 6 ^^ = 0. 





108 


GRAVITATION AND RELATIVITY 


The only surviving terms are ; 

^11 ^ ~ ^22 ’ ^12 = ^21 • 

We thus have once more two independent polarisations which correspond to spin 
2, for the basic vibration pattern repeats after a rotation of 90° rather than 
180°, This is enormously important, because we know from special relativity 
that a massless particle has only two possible spin states, along and against 
the direction of motion, [2] . The photon is a massless spin 1 particle and 
now we infer the existence of the graviton, a massless spin 2 particle. 


In the half-tensor (Newtonian) theory, we have from source conservation 

. 


Bx 


Vi 


■= 0 or k e =0 
\1 IJIV 


If we set 


we find 


Viv 


k 6' 

\1 ^iV 


= e . + k. Cv. + k e 




V li 


= k, 0. + k k e + k k e 

\i yiv |ji yx V la V ^jL 


Only 


The condition k^e^= 0 is not automatically satisfied by any e^. 
three of the four quantities e^ are independent and so instead of eight con¬ 
straints on ten independent variables giving two polarisations we have seven 
constraints on ten independent variables giving three polarisations. The half¬ 
tensor (Newtonian) theory is thus a mixture of the tensor (spin 2) theory and 
the scalar (spin zero) theory, and in our apparently artificial construction 
of the spin 2 theory we were in fact subtracting off the scalar piece in the 
half-tensor theory. On the grounds of simplicity and elegance we would pre¬ 
fer the theory of gravitation to correspond to spin 2 or spin 0 rather than a 
mixture of the two, and in this framework the deflection of light by the Sun 
of twice the Newtonian value at last appears as an entirely natural result. 


8.3 Physical effects of gravitational waves 

In order to understand the content of the two physical polarisations we 
need to study the physical effects of the radiation. If a train of gravita¬ 
tional waves passes through a laboratory, what sort of physical effect does it 
produce ? The first thing that must be realised is that the local gravita¬ 
tional field is not directly detectable because we must look at the relative 
motion of at least two objects to detect an effect. Electrons in wires are 
sensitive to the electric field of electromagnetic radiation because in such a 
field the electrons go one way and the positive lattice goes the other. But 
everything accelerates the same way in the same gravitational field and so we 
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need two separated objects, with their relative motion determined not by the 
first derivatives of the potential but by the second derivatives — the tidal 
forces. 


We have therefore to take as our primordial gravitational antenna two (point- 
like) masses separated in space and measure their separation. Let us calcu¬ 
late the effect on this separation of a gravitational wave characterised by 

(*^11 ’ ^22 “ “ ^^12 ’ ^21 ~ ** 12 ^ ' 

We return to our equations of motion (5.4.4) developed originally for solar 
system problems : 


% 

at 


dh 


= m V V 
li V 


, ^ - mv V 

li V 


bx 

1 ^ 


(5.4.4) (8.3.1) 


At this point we encounter what is at first sight a serious difficulty. A 
mass at rest has all elements of T equal to zero except for T^ . Our 
wave however is represented only by h^^ , h^^ > ^12 ’ ^21 therefore imping¬ 

ing on a mass at rest it gives 


% 

"dt 


= 0 


(8.3.2) 


This however is as seen from the ideal coordinate frame we have established 
free of the effect of all gravitational fields. In this frame two masses 
initially at rest suffer no acceleration. But it is not clear that this is 
the case from the point of view of the observer with a measuring stick 
stretched between the two masses. Remember that in time independent poten¬ 
tials, the tensor, half tensor and scalar fields gave the same h^ and slowed 
down clocks by the same amount. However, the h^^ components had opposite 
sign for the tensor and scalar theories, respectively shrinking and expanding 
measuring sticks, while the components were absent in the half tensor 

theory and measuring sticks were unaffected. We guess therefore that it is 
the h. . components that alter the lengths of measuring rods an(t that in the 
ideal frame the two masses do not move but rather the length of a measuring 
rod next to them fluctuates. The local observer however has no way of tell¬ 
ing his measuring rod is fluctuating in length and so interprets his observa¬ 
tions as meaning that the separation of the two masses is fluctuating due to 
a periodic tidal force exerted in the passage of a gravitational wave. This 
interpretation is indeed implicit in the more formal developments of section 
6.5. 

Take equation (8.3.1) and set x^). Then we have 
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% 2 ^*^11 
“dt "“^1 dx 


hh 


11 




dt 


I = -“v2 7hjj-mvj{v. 7 
Bh. 


hh 


7h,i. 


11 


St 


— - 

at = - ®^l 


^11 


(8.3,3) 


(8.3,4) 


at 


for the motion of a particle of mass m . In interpreting these equations we 
must first find expressions for the momentum and energy. Consider motion in 
the x^ direction. The first thing we notice is that dE/dt is a function 
E is not changed in the slow motion limit. This suggests writing 


of vf 


when 


E = 


En 


^/lT 


vf k 


dE -^o / **^1 ? dkl 


Then to first order in small quantities, 




dk 


^1 at ^1 at 

1 - V? 


Sh. 


11 


dt 


We also have 


<^3 2 ^ 

at - Bxj 


(8.3.5) 


(8,3.6) 


A light ray along the direction is therefore deflected in the x^ direc- 

tion by an amount depending on the gradient of h^^ . In a wave picture this 
requires an acceleration in the x^ direction. If we set 


k E Vj = Pj 


then 


^1 ^Pl dE dk f 

‘dr = 'dr“^iH“^^i^n-^i 


dh. 


11 


dt "1 

which on multiplying by v^ yields (8.3.5) again. 


dh_ 
,5 11 


at 


ak-i 

“"i at) 


The refractive index necessary to produce the deflection in the x^ direction 


implied by Eq. (8.3.6) requires 


and hence 


Cj = (l-hjj) 
k = 1 + hjj . 
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For slow motion 


at 


ah 


11 


1 at 


at 


•^ = 0 


ana 


E = 


/1-k 


2 „2 


Pi 


kE V. 
o 1 

'\/l - k^ vf 


(8.3.7) 


In the presence of other forces 

ap^ 

"oT 


- Pi ~w~ ^1 


dE T. 

at =^ 1^1 

ana following the arguments of section 5.5 we make the new aefinition 


(8.3.8) 


Pi ^ +^ll) Pi = 


Vl - V? 


Thus the rest energy is unchangea, the velocity of light in the airection 

is c^ = ana the rest mass + • The quantity m^^c^ 

is the locally aefinea rest energy ana is inaeea unchangea. Since energy is 
unaffectea, atomic frequencies are unchangea and the charge in the velocity of 
l ight therefore implies that a length is changea by a factor T-h^^. As 

a final check, for one aimensional motion in a Bohr atom we write 

2 

1 ? e 

^mvj - — = constant 
^ 1 X 

1 Xi 

ana this is inaeea satisfiea to first oraer in h^^ if x^ 

e2 e2/(l + hji) . 


We have then a consistent picture in the spirit of section 5.5. The effect 

of gravitational waves on the separation of two masses, initially at rest, is 
to proviae a relative acceleration 


a^x. 


at^ 




JJ. 




(8.3.9) 


where x? is the separation in the j airection in the absence of the wave 
J 

ana all measurements are maae locally. 


8.4 Polarisation properties of gravitational waves 

The two inaepenaent polarisations are given by (h^^ ana 

(^12 “ ^ 21 ^ • 
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We consider a set of four particles, arranged at the corners of a square nor¬ 
mal to the direction of propagation of the gravitational wave. This is the 
minimum set of particles necessary to bring out the general features, since 
we need two particles to detect any effect and each polarisation affects 
length measurements in two orthogonal directions simultaneously. We obtain 
the sequences shown in Fig. 6.4.1. 


1 

• 

1 

• 

<-• 

1 

• 

hi, = 

<-• -j- 
1 

-h22 

• + 

• 


• 

<-• 

• 

14-1 

• 

4- 

• 


* + . 

<-• 

• 

hi2 = 

^21 

• 


Fig.8.4.1 Positions and velocities of four test particles 
in a plane normal to the direction of a steady incident 
gravitational wave, as observed locally. The configura¬ 
tions are shown every quarter period 

For a general wave we can combine these two polarisations with any magnitudes 
and relative phase. It is clear that these vibrations are quadrupole in 
character and that the elementary mode of vibration induced by a gravitational 
wave is quadrupole, just as the elementary mode of vibration induced by an 
electromagnetic wave is dipole (dipole oscillations cannot exist for gravity 
because there is no negative mass and hence no gravitational dipole). 


8.5 Detection of gravitational waves 
Our elementary gravitational 
antenna will therefore consist of a 
pair of masses M, connected by a 
spring if we want to make a resonant 


Fig. 8.5.1 

(Fig. 8.5.1). The equa¬ 
tor the case where the antenna is 


antenna, with some means of measuring the separation, 
tion of such an antenna can then be written, 
aligned with one of the principal stress axes , 
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where x 

o 

ration. 


d^x dx 2 

— p •+ y — + X 
dt^ dt o 


at^ 


is the rest separation of the masses and 


At fixed frequency 


U)^h : 


0)^ - UD^ - io) Y 
o 


X 


(8.5.1) 

is the change in sepa- 

(8.5.2) 


The spring constant controls the resonant frequency and the damping constant 
Y the Q value of the detector. A small value of Y gives a narrow band 
antenna, with a high Q, while large Y increases the bandwidth. If the fre¬ 
quency spread in a gravitational wave is « Y» sensitivity of the detector 

to frequencies around o)^ depends inversely on Y • "the gravitational 

radiation is broad band, however, the energy absorbed is independent of Y* 


We should also note that an antenna of this kind has directional properties. 

The quantity hx^ varies as cos 2 cp or sin 2 cp as we rotate the axis about 
the direction of propagation of the wave: measuring (p from the x-axis the 
former result is for (h^^ , h^g) fhe latter for (h^^ i ^ 21 ) • change 

the angle between the axis of the antenna and the direction of propagation, 
the tidal forces act on the projected separation L thus introducing a term in 
sin 9 . The tidal force hLsin9 acts normal to the direction of propagation, 
but longitudinal oscillations of the detector of amplitude L are induced by 
the component of this force along the detector axis, introducing a second fac¬ 
tor of sin 0. Therefore 


■ I p«i I f\J 

dt^ 


h L sin^6 cos 2 (p 


for a polarisation (h^^ , ^ 2 ^ ‘ illustrated in Fig.8.5.2. 


hZ sinScos 2<#> 



Fig. 8 . 5.2 The angular response of an elementary gravitational 
wave detector. The detector shown in the figure makes an angle 
9 with the direction of propagation of the wave and has an 
azimuth measured from the h^^ direction. The wave shown is 
characterised by (tint ^22 ~ 
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The detection of gravitational waves is discussed in detail in the recent books 
by Misner, Thorne and Wheeler, and by Weinberg, [3], Here we will only work 
out a few useful results. We take the case of a detector aligned with one of 
the stress axes of the gravitational wave, so that 

0 ? h X 4 a) hx 

o _ o o 

X = — - . 

0)^ - 0)^ - i Y UI) 03-0) + i 

o o 


2 


(8.5.3) 


Then , 4 i h x 

dx o o 

dt "" . Y ' 

0 )- 0 )^ + 1 ^ 

(it should be remembered that in these equations h has the dimensions 
that is, it is dimensionless.) 


(8.5.4) 

GM 

Lc2 ’ 


Locking ourselves firmly into the local system of coordinates defined by real 
measuring rods, the steady state vibrational energy of the antenna, in a mono¬ 
chromatic wave, is 




E 


.a- 

: 2iM( 


max 

2 J 


\z j, Ma)'^h^x^ 

r - -1. o Q o 
" 16 


(o)- 0)^)^ 


The rate of dissipation of energy is E^t, where 
absorption of energy from the wave must be 


T = 1/ Y , 


(8.5.5) 


SO the rate of 


J[_ 

16 


M Y U)^ x^ 

0 0 0 


(id- id„)2 + ^ 


(8.5.6) 


Averaging Eq. (8.5.6) over a broad frequency band gets rid of Y* Now the 
quantity ^ 


has the dimensions of energy flux, 
as 


We rewrite the energy absorbed each second 




M Y iiJ 
' o o 


G 


, .2 L‘ 

(id- UJ„) + -t- 




(8.5.7) 


The left-hand box has the dimensions of an area and contains quantities charac¬ 
teristic only of the detector. It must represent the cross-section of the 
detector for the absorbtion of energy from the wave. The right-hand box has 
the dimensions of an energy flux and must represent the energy flux in the wave, 
down to a numerical constant. 


We may tentatively guess that the instantaneous energy flux associated with 
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8 ttG \v^ / ''’Vdt/J' 

I^This makes some sort of sense in the light of our knowledge of Newtonian 
gravity. The gravitational self-energy can he written as 

- ra J ® 

with h having dimensions GM/R , and with h dimensionless as 


8 ttG 


J ^ •] 


With -h„«=h..=h , this gives us on time averaging an energy 

-Li- O 

flux 


= 


h2 ^ 


G ~ 8 ttG o 

with h dimensionless, or if h has dimensions GM/r , 

h2 0^ . 


G ■ 8 ttG o 


(8.5.8) 

(8.5.8a) 


With this tentative identification (note that we have attempted no proper deri¬ 
vation of the energy flux in gravitational radiation) we find the cross-section 
of the simple antenna we have been considering to be 


TT G 




(uu- U) 


(8.5.9) 


for a single frequency near resonance. In Eq. (8.5.9) we have suppressed the 
angular dependence of the cross-section, which may be inserted by multiplying 
by a factor sin"^ 9 cos^2 cp if cp is measured from one of the stress axes of 
the radiation. 


For a broad frequency band the power absorbed each second is thus 

'^^GMtJu^ x^F(a)) (8.5.10) 

^3 0 0 o 

where F((ju) d uu is the energy flux in the incident radiation between U) and 
UO + duu. 

We are now in a position to consider briefly such experimental results as 
exist at present (1975) but to set the problem in perspective we first discuss 
the generation of gravitational waves and possible astrophysical sources of 


such radiation. 
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8.6 Generation of gravitational \iaves 

We \\?ill consider briefly gravitational quadrupole radiation: much more 
detailed dispussions may be found in Misner, Thorne and Wheeler and in 
Weinberg [4] . 


The solution (8.1.1) of the inhomogeneous wave equation is 

- (r' » t - Ir-r'l) 


h (r , t) = 2G f 3: 


3^' 


For r » r' we can write this 


IJV 


d'^r 


Vt'*) “f J k-i’D-'r' 


( 8 . 6 . 1 ) 


For a fixed frequency __ 

3^ (rt) = S' (r^) e 

M-v * liv 


-i(Dt 


and if the wavelength is large in comparison with the dimensions of the source 
the correctio] 
nored, giving 


the corrections to the phase of h (jr , t) due to variation of may be ig- 

|»aV 


jiV — ' r J ' 


( 8 . 6 . 2 ) 


We expect this approximation to correspond to quadrupole radiation, just as 
the equivalent approximation in electromagnetic theory corresponds to dipole 
radiation. 


We have 


h (r, t) ei(kr-(«t) ^ 


where 


IJ.V'—' ' r |iv 

T, = [ 7 d^r' 

and we are interested in h^^ , ^22 ^12 * 

T. . = f 3. .(r') d^r' = v. v. M 
for a particle of mass M. For a single fixed frequency 


T. . ^ - UU^ X? X? M 


and so 




1 J 


h..(r,t) «_Gei(kr-U)t) 2 0 

ij v_ 1 / r 


(8.6.3) 


(8.6.4) 


where is the anplitude of the quadrupole fluctuation. 


If we work in units with h dimensionless, and write in c explicitly 
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^ G i(kr-a)t) 0)^ 

h. Jr , t) ^ ~ e ^ 4 • 


r ^4 ij 

The power radiated at the frequency uu is thus 

P -g 4 iQl^ 


where Q is the quadrupole moment. 


(8.6.5) 


( 8 . 6 . 6 ) 


We can make a few numerical estimates. Suppose we have a close binary system 

2 

with a period T ^ 10 hours, the mass of each member being Then Q'^M^R , 


where 


and 


GM T 


2 1 


/ A X V ^ 

R ss ■ —2^) « 2 X 10^^ 

2 TI^ 

Q ss 10®® gm cm^ , uu ~ 3 X 10“® s ^ , 
P « 10^° erg s”^ 


which is lO-"^ of the luminosity of the Sun. The energy of orbital motion 
is ^ 10"^® ergs, and so the decay time due to radiation of gravitational waves 
is « 10^® sec « 10® years. This is a perfectly realistic example (the system 
containing the pulsar PSR 1913+16 has very much these characteristics). 


As a more extreme and less realistic system suppose there to be a pair of neu¬ 
tron stars, masses but separated by only « 10^^ cm . The period would be 

« 10"^ sec, Q « 10^'^ gm cm^ and the power radiated « 10^^ erg s“^. The energy 
in orbital motion would be 10^® erg and the corresponding decay time « 1 year 
(This is only one of the reasons why this example is unrealistic.) 

Finally, suppose we have a neutron star rotating at a frequency uo 10"^ s ^ . 
This kind of frequency results if you consider a star with the mass, radius 
and rotation period of the Sun collapsing to a radius 10^ cm with internal 
conservation of angular momentum. A similar result is obtained by extrapo¬ 
lating back the slowdown rate of the pulsar NP 0532, in the Crab nebula, to 
its birth, 1054 A.D. (The present pulse period is 33m s.) Then 

/ P 10"®® |Q J erg s“^ 

and Q lO"^® e where e is an ellipticity factor measuring a departure from 
rotational symmetry. 

P 10®"^ e^ erg s”^ 

and the rotational energy is 10®® ergs. The conditions described are close 
to breakup and so in the early stages of neutron star formation one might get 
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10^^ ergs s"^ for around 1 sec . If the neutron star avoids breakup and 
settles dora with e ^ 10““^, then 

P 10^® erg s“^ 

and the slowdown period is p«1 year. 

We may compare these figures with the total energy released in gravitational 
collapse of a star like the Sun to nuclear densities where R 5==^ 10^ cm . This 
is ^ 10^^ ergs . The conversion of 1 into gravitational energy would 
yield 10^"^ ergs . 

It would seem that we have no right to expect sources of gravitational radia- • 
tion producing more than 10^"^ erg over 1 sec or more than 10"^® erg s”^ 
over 1 year, at least if we restrict ourselves to simple stellar systems. 

8 .7 Attempts to detect gravitational waves 

Suppose we have a burst of gravitational radiation lasting 1 sec . 

The energy absorbed in a simple resonant antenna is 

Pj = j Pdt 10"^° GMii|2x2f((d^) (8.7.1) 

where M, uu and x are characteristics of the antenna, 
o o 

The first attempts to detect gravitational radiation were made by Weber [s] 
who employed as detectors several aluminium cylinders in Maryland and a second 
array at Argonne, some 1000km away. 

These cylinders have a mass of 1.4 X 10^ gm , a length of 153 cm and the natural 
frequency of the lowest longitudinal vibration mode is 1660c,p,s. For such a 
cylinder 

Pj « 10 "^® F((d^) erg . 

If we take as a I'ough guide to the signal discernable above noise 

Pj ^ kT « 5 X 10-^^ erg 

for a noise temperature of 300°K, then for a pulse to be detectable 

^ lO"^ erg cm”^ s"^ per cycle. 

The total energy flux would have to be ^ 10^ erg cm~^ s”^ over the period of 
the pulse. If the source were located at the galactic centre and radiating 
isotropically, this means an energy release ^ 10^^ erg to make up the pulse; 
not impossible, but nudging the upper limits we worked out in the previous 
section. 

Suppose on the other hand I660c.p.s. is essentially on frequency for the 
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rotation of a collapsed deformed star. The power picked up each second then 
depends on the Q value of a resonant antenna: for Weber's cylinders the con¬ 
stant Y ^ 0.1 s”^ . 

The energy absorbed each second is ^ (lO E erg, where F is the inci¬ 

dent flux. If the steady state vibrational energy ^ kT for detection of this 
signal, and the natural decay time of vibrations in the cylinder is 10 sec , 
then F « 10"^ erg cm~^ s”^ . This corresponds to an isotropic source at the 
centre of the galaxy radiating 10^® erg s”^ . 

We may conclude that if we can pick up induced vibrations at this noise level, 
catastrophic events near the centre of the galaxy are detectable, but steady 
single frequency signals of the kind that might be produced over the first few 
years after neutron star formation would have to have a fairly local source, 
say within 5,000 light years. 

It is salutory ttXNcalculate the amplitude of vibration corresponding to an 
energy 10~^^- j ergs . 

10"^^- 10-^^ Mut^ (Ax)2 «10^^ (Ax)2 

so Ax 10“^^ - 10“^'^ cm . This is less than the diameter of an atomic nucleus. 

The corresponding strain is 10“^^ - 10^^^ . There are in fact several ways of 
measuring displacements and strains of these sizes. Weber's detectors utilize 
the piezo-electric effect, with piezo-electric transducers glued to the sides 
of the aluminium cylinder. In other similar antennae, two half cylinders are 
bonded together via a slab of piezo-electric material. The piezo-electric 
first used by Weber was quartz, which develops 0.16x Coulomb surface charge per 
square metre for a strain x . This corresponds to an electric dipole moment 
per cubic centimetre of 10~^ e-cm for a strain of 10“^® : a dipole moment per 
molecule 10“^^ e-cm . Such a dipole moment means a separation of unit elec- 
tron charges of ^ 10 atomic diameters, a fractional displacement of charge 

In Weber's later work he used ceramic barium 
titanate transducers : 44 of dimensions 

5X5X1.2 cm^ bonded to the equator of the 
cylinder, where the strain is greatest [6]. 
With a surface area of 200 cm^ the total 
charge produced for a strain of 10“^® is 
« 10-^9 Coulomb, one electron charge. This 
fluctuating charge was measured from the 


of the same order as the strain . 


Antenna 



G.R.—E 
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voltage developed across a high impedance: Weber [?] gives the equivalent cir¬ 
cuit of Fig.8.7.1, with C = 6X10”® f , L = 1.6 henry, R = 0.1 ohms. The 
impedance of this parallel tuned circuit is 

R + i UOL 

Z = - r - 

RiuuC-uu'^LC + l 

which is approximately l/RC on resonance, ^ . With the above values, 

U)j^ = lO'^ c. 

Since V = I Z = iu)q Z the amplitude of the voltage developed is 10“® volts 
for a charge of 10”^® Coulomb. This fluctuating voltage is then fed through 
a low noise amplifier. 

Another technique that has been used is to sense the relative displacement of 
the ends of a cylinder through changes in the capacitance of a parallel plate 
capacitor with one plate linked to each end. A relative displacement of the 
ends of amplitude 4 X cm has been sensed as a radio frequency voltage 

of amplitude 4 X 10”^ volts, [s]. 

Displacements « 10"^^ cm have also been measured using lasers. Timing pulses 
is limited to 10 cm accuracy, measurement of fringe shifts in optical inter¬ 
ferometry to 10“^ cm , but by picking out the beat frequency between light 
reflected back over a fluctuating path length and light which has travelled a 
fixed path, very small amplitudes can be measured. A SOOOc.p.s. vibration 
of amplitude 3 X cm has been measured in this way [o]. This was done 

with a laser beam, the two paths being arms of a Michelson interferometer. 

After recombining the split beam the amplitude is 

A'^sinuut + sin(uut+cp)^ (8.7.2) 

where cp is the phase difference between the beams that traversed the active 
and passive arms of the interferometer. 

cp = -^ x(t) + cp^ 

where x is the instantaneous amplitude of the vibration. The recombined 
beam is detected photoelectrically and the output of the photodetector is pro¬ 
portional to the square of Eq. (8.7.2) which after averaging over the optical 

frequency vibrations is —. ■ 

2 A^ sin^uut [l +.cos cp ] . 

For maximum sensitivity, choose cp = tt/ 2 \dien the beat signal is 

2 A^ sin^ U)t sinf ^ x(t)l . 


,s. , = 1660c.p.s. and = 3X10 


ohms . 
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Let x=x sin (ju't and for small x /X the heat signal is 
o o 

^ 2A^ sin^iut ^ X sin U)^t 
X o 

and the amplitude of the heat signal yields The limiting factor in 

the work of reference [ 9 ] was photon noise. 

Weber [5,7] has reported coincidences between his Maryland and Argonne arrays 
at the rate of '^100 per year, apparently explicable only in terms of pulses 
of gravitational radiation. In 1973 he reported seven events/day above a 
level of kT/lOO, [lO]. He has used the sin^9 dependence of the cross- 
section to get direction information, finding the rate to be correlated with 
sidereal time (but not with solar time). Weber claims a clear excess of 
pulses travelling along a line joining the solar system with the galactic cen¬ 
tre [ 11 ] (with a simple quadrupole antenna it is not possible to tell which 
way such pulses are travelling). 

These results are astonishing. Supernova explosions are observed from Earth 
in our galaxy every few hundred years. From observations of other galaxies 
a total rate of about 1 supernova in 30 years has been inferred. For iso¬ 
tropic radiation, 100 detectable pulses each year from the centre of the 
galaxy implies an energy output > 10^^ erg each year. This is greater than 
the energy radiated as starlight in the galaxy, and means the conversion into 
gravitational waves of >100 each year. The galaxy contains '-lO^^ stars 
and has existed for 10^° years: this is a level at \diich galactic dynamics 
should be visibly affected, [l2]. 

If Weber’s interpretation of his data is correct, it would seem that either 
the radiation must be strongly beamed in the galactic plane or that activity 
in the core of the galaxy has only recently switched on. In either case the 
core of our galaxy must be a far more violent region than we had any prior 
reason to suspect. 

A number of other detectors of gravitational radiation have been brought 
into operation over the last few years [8,13]. In no case has any indepen¬ 
dent experiment confirmed Weber’s exciting observations, which accordingly 
must be regarded as far from proven. 

At present gravitational wave detectors are probing into the region of sensi- 
-^j^-y'ity to strains ^ 10”"^ and the ultimate hope is to achieve a sensitivity 
sufficient to detect cataclysmic events occurring in the rich Virgo cluster 
of galaxies. 
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CHAPTER 9 

GRAVITATION AND THE GEOMETRY OF SPACETIME 


9.1 Introduction 

Throughout most of this book M^e have been \vorking in the standard 
coordinate systems of special relativity. We set up the Lorentz covariant 
equations for the tensor gravitational field and calculated the deflection of 
light relative to such an Olympian reference frame, and the radar echo delay 
due to the interaction of electromagnetic waves with the.gravitational field. 
The equations of motion that were forced on us required a redefinition of 
momentum in addition to the obvious redefinition of energy in the presence of 
gravitation, and we then discovered that a gravitational potential distorts 
physical measuring rods and slows down clocks, relative to our Olympian refer¬ 
ence frame. In a freely falling laboratory of such a size that changes in 
the gravitational potential over the laboratory could be neglected, we found 
the velocity of light, as it would be measured, to be constant. We also found 
that the kinematic equations of special relativity recover their standard form 
when everything is measured in local terms. As a consequence the principle 
of equivalence holds (at least for light and relativistic kinematics) and the 
Olympian reference frame is locally unobservable. 

In these circiimstances we may either adopt the point of view that gravity 
bends measuring sticks and slows clocks in the spacetime of special relativity, 
or we may consider these instruments as embedded in a spacetime which is itself 
distorted by the presence of mass. From the former point of view it might 

prove advantageous to formulate the theory of gravitation in terms of the 
locally measured quantities rather than sitting in an unobservable reference 
frame: from the latter point of view it is the obvious way to proceed. 

The principle of equivalence is put in as the starting point, in the following 
form. In a sufficiently small region of spacetime, the laws of physics in a 
freely falling laboratory are the laws of special relativity, including their 
numerical content. The laws of physics in any other reference frame — a 
reference frame locked to a gravitating body will usually prove most conve¬ 
nient — may be obtained by a coordinate transformation alone, and this trans¬ 
formation is linked to the gravitational fields. 
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The Lorentz transformations themselves link any two inertial frames and do not 
contain any gravitational attributes. In an accelerating frame inertial 
forces appear which are reflected in the transformations: the recipe is to 
transform from a given inertial frame to the instantaneously comoving frame. 
These inertial forces may be regarded as gravitational forces, but of a parti¬ 
cular kind. There exists a transformation (back to the original inertial 
frame or any other) that cancels them completely, everywhere and everywhen. 

No such transformation exists for permanent gravitational fields. A space¬ 
craft in free fall close to the earth has an acceleration ^g, as measured in 
an earthbased system of coordinates. In a frame of reference falling with the 
spacecraft there are (locally) no gravitational effects. The two frames are 
linked by a coordinate transformation. But if one spacecraft is falling to¬ 
wards the Atlantic Ocean and another towards Siberia, the two associated freely 
falling frames are approaching each other at '^20kms’’^. In one freely falling 
frame the other spacecraft appears to be accelerating at '^2g, due to the gravi¬ 
tational field of the earth. It is when permanent gravitational fields are 
present that we speak of a curved spacetime, for whatever global transformation 
of coordinates we make, we cannot globally get rid of gravitation. 

9.2 The metric tensor and equations of motion in free fall 

We shall be considering particle dynamics and the propagation of light 
in gravitational fields. Consider a freely falling particle as viewed from a 
local freely falling frame. The principle of unique acceleration tells us 
that any such particle is not accelerating in the local freely falling frame. 

Set up in such a frame the standard axes of special relativity X = (x,y,z, ict) 
where |J. runs from 1 to 4. Lack of acceleration is written as 



(9.2.1) 


where j runs from 1 to 3. The proper time elapsed in the rest frame of the 
particle is a Lorentz invariant given by 


dT^ =- dX dX 

1-L 


(9.2.2) 


with c=l. Because there is no acceleration an interval of proper time At 
is strictly proportional to an interval of coordinate time At in a local 
freely falling frame, so Eq. (9.2.1) may be written in the form 



(9.2.3) 


It is convenient to redefine the coordinates in a freely falling frame so that 


?a= (ct , X , y , z) 





GRAVITATION AND THE GEOMETRY OF SPACETIME 


125 


Tvhen have in place of Eq. (9.2.3) 




(9.2.4) 


'with the invariant interval, \^hich corresponds to proper time on the particle, 


given by 


^ 

= - 1 


a = g = 1,2,3 
a = g = 0 


= 0 g 


(9.2.5) 


We noM make a transformation to another reference frame in ivhich the particle 
coordinates are x , ^where a no^ runs from 0 to 3 . We do not at this 
stage attach any significance to this ne\] frame of coordinates. The trans¬ 
formation might denote something as trivial as a redefinition of the coordi¬ 
nates in the original frame, for example x^ = ta if there was some point 
in doing this. (This sort of transformation is made whenever logarithmic 
graph paper is used.) The transformation might merely be to another frame 
related to the original by the Lorentz transformation, to an accelerating 
frame in the absence of permanent gravitational fields, or to a frame of 
reference locked to a gravitating object like the Sun. So far, it does not 
matter. The prescription that we follow is that the transformed physical 
laws are still valid and with the same numerical content. In particular, an 
invariant such as proper mass or an interval of proper time still has the same 
numerical value. Physical laws which maintain their form and niimerical con¬ 
tent under such general transformations are generally covariant (as opposed 
to merely Lorentz covariant) and this is the origin of the term general 
relativity. 


Either the old set of coordinates ? or the new set x are sufficient to 

a IJ. 

describe completely the motion of a test particle. We may therefore write 

Cl§ = dx . 

a la 

This is just an expansion in terms of partial differentials. 

= - g dx dx 
ii V 


(9,2.6) 

Therefore 


dT^ = 


where 


(9.2.7) 
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®uv = sT ^ 

^ ^ V 

The quantity g is clearly symmetric. It is important to note that the 

[J-V 

quantity dj is generally invariant, having the same value in either the 
§-frame or the x-frame. For example, in an inertial frame (a local freely 
falling frame) the invariant interval is zero for light. In an arbitrary 
coordinate system therefore the propagation of light must folloTV the equation 


g dx dx = 
^\1V l-L V 

and this is a generally covariant equation, 
transform Eq, (9.2.4) as follows 

d^? 


0 

With 


aT2 


A 

rd? ... 

{_cc t 

-11 

_a 

dT 

1 dT j 

“ dT ^ 

. dx 



14 


d^x 

dx 

i' 

a 

_ 


a 1 


dT^ 

^ dT 

dT 1 

a? 

d^x 

dx 

dx 

a 

u 


V 


dT^ 

11- 
1^3 
■f 

dT 


dT'^ an invariant, we may 

dx ^ 
d.T J 


1-L 


a 


K 'dx 

V 


= 0 


This is again just an expansion in terms of partial differentials, remembering 


that neither § nor x are explicit functions of T : § 


X = 

Id 


s (5 ) . 

la ^ o: 


Id 


? (x ) or 


We now have a set of four equations relating the four components of four- 
acceleration to the four-velocities and the partial differential coefficients. 
We may solve them simultaneously to obtain the four-acceleration. Multiply 
each of the four equations, labelled by the only index which is not dummy, a, 
by the quantity where X has any value from 0 to 3 , and add all 

four equations. This operation is easily written using the convention of sum¬ 
mation over repeated indices 

^ 

Now note that 


dx / dx,\ 

-4 — \ - 

aij jax.. ax. 


= 0 


(9.2.9) 




dx 


= 6 , 


Xid 


because each coordinate in one system is a function of the four coordinates in 
the other system. Thus on summing over a we only retain the term for which 
jjL = X in the left-hand piece of Eq. (9.2.9). Then 





GRAVITATION AND THE GEOMETRY OF SPACETIME 


127 




dr 




Bx 




(9.2.10) 




dT^ = - g dx dx 
^liV li V 


The equation (9.2.10) contains a single coefficient composed of partial differ¬ 
entials and ve •write 


d^x 


dT^ 


\ -pX 

+ i 


dx 


dx 


X ^ ^ = 0 (9.2.11) 

IJLV QT dT 

•where the coefficient is a function only of the coordinates (xq,x^,X 2 ,x^) , 

\J>V 


Now we notice that 
2 g 


^ 


(9.2.12) 


and is clearly related to the F's . It looks as though the F's can he 
expressed in terms of the quantity and its first derivatives and we can 

at once anticipate that the g can he interpreted as gravitational poten- 
tials and the F's as gravitational fields. 


With 


^ _ 


= 

^IV 


•we may multiply by d5^/dx^ and summing over p on the left-hand side before 
attempting to sum over CX have 


so that 


a we have 


!!i 


_ ^ 

dx 


■ 

P 

a 

a 





Y 

Y 


dx dx 

dx 


ay 


IJV 


Substituting in Eq. (9.2.12) 


iV 




y + g rf 

pv Xia pp 


(9.2.13) 


The only dunmy variable is p, so Eq. (9.2.13) is shorthand for a set of 64 
equations relating the 64 first derivatives of g^^ to the 64 quantities F. 
(Because symmetric with only ten independent components we actually 

have 40 equations relating 40 derivatives to 40 independent F's.) These 
equations may he shuffled around, til , to yield 






(q 9 1 Ai 
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where the quantity g 
We thus have 


is the inverse of g , defined hv = 6 

^OV’ ^ ^ ^ap vp 


dr 


.2 - 2g I 


d^x. 


dT^ 



(9.2.15) 

(9.2.16) 


At this stage we may throw away the partial differentials involving the 
§’s , which are only locally defined, and the equations of free fall are ex¬ 
pressed in terms of the purely geometric quantity g^^ and its first deri¬ 
vatives. The quantity g is called the metric tensor and the quantities 
X 

the affine connections. 


Now Eq. (6.4.11) may be rewritten with the same indices as (9.2.16) for easier 
comparison. Eq. (6.4.11) becomes 


<V * “(*> 


d^x. 


dT^ 


i 


dh, 


dx 


% Zv 


3x J dT 


dx 

V 
dT 


(9.2.17) 


Equation (9.2.16) is written in terms of purely geometric quantities and bears 
a great resemblance to Eq. (9.2.17) written in terms of gravitational poten¬ 
tials. If we were to set 

g . = 6 + 2 h , 

pX pX pX 

the equations would be identical. It must be remembered that Eq. (9.2.16) 
follows the convention 

x^ = (ct , X, y , z) 

while Eq. (9.2.17) follows the convention of special relativity 

x^= (X,Y,Z,icT). 

We could clearly have followed the latter convention in the preceding work, in 

which case in the absence of gravitational fields g -> 6 rather than 

p.v 


These two equations are therefore identical when written using the same conven¬ 
tions and in the ict convention identifying 

g =6 +2h 

liv liv 

(We could equally well rewrite Eq. (6.5.11) and related equations in the 
convention.) The terms of the metric tensor thus represent gravita¬ 
tional potentials and their derivatives gravitational force fields : the work 
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of this section and that of section 6.5 tie nicely together. We have hoivever 

no recipe so far for finding the g corresponding to a permanent gravita- 

|J.V 


tional field. 


9.3 Concerning the field equations 

In Chapter 4 we were faced with the problem of finding the gravita¬ 
tional potentials within the framework of special relativity. We solved this 
problem by searching for Lorentz covariant equations with the right Newtonian 
limit, and had then to determine the final form by appealing to the measurement 
of the deflection of light by the Sun. The tensor theory we constructed turned 
out in Chapter 8 to have a gauge invariance property and correspond to spin 2 
gravitations. The Einstein field equations for the g^^ are determined by 
requiring that the energy-momentum tensor shall be the source and that the 
right Newtonian limit obtains. The equation must therefore be something like 

IGttG 




. S', 


(9.3.1) 


C- liV 

where 3 is the energy-momentum tensor of everything except gravitation and 

is the equivalent in general coordinates of the d’Alembertian operator in 

Cartesian coordinates. The ? recognises the probability that the left-hand 

side will also contain products of the derivatives of g , since the gravi- 

H’V 

tational field energy is a source of the gravitational potential. The princi¬ 
ple of strong equivalence is then put in by requiring that the equations 
determining the gravitational field shall themselves be generally covariant, 
like everything else. There is then only one possible form for the left-hand 
side of Eq. (9.3.1) which gives the correct Newtonian limit . Furthermore, 
in a local freely falling frame where no gravitational effects are apparent, 

3 is conserved. This requires the left-hand side of Eq.(9.3,1) to satisfy 
P-V 

four identities and this is only possible with one ratio of the two pieces. 
The set of ten independent equations (9.3,l) (l6-6 because 3^^ is symmetric) 
with the imposed identities now have four degrees of freedom such that if 
g ^(x) is a solution, so is where “ay he computed from 

g (x) and any specified transformation of coordinates x -* Such a trans¬ 

formation relates every x^ to the corresponding set of x^ and of course can 
never dispose of a permanent gravitational field everywhere: it corresponds to 
a redefinition of coordinates in a specified curved spacetime. This condition 
is necessary for general covariance of Eq, (9.3,l), which must, if the principle 


A term Xg could be added but there is no evidence for the existence 
PV 

of such a term. The quantity X is known as the cosmological constant. 
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of strong equivalence is true, be equally valid in any coordinate system. In 
our flat space theory this condition appeared as gauge invariance. The prin¬ 
ciple of general covariance applied to the gravitational field equations, 
strong equivalence, is sufficiently po\\?erful first to require field equations 
reducing in the weak field limit to the tensor theory (thus predicting the de¬ 
flection of light by the Sun rather than using it as an input) and secondly to 
determine without ambiguity the full field equations including the nonlinear 
effects due to gravity being a source of gravity. The substantial mathemati¬ 
cal development necessary to handle all this may be found in many places [2]. 
It should however be clear that for the solution of a problem in gravitation 
(for example, planetary motion or the deflection of light by the sun) it is 
first necessary to choose four conditions which effectively specify the coor¬ 
dinate system in which g is expressed. The motion of a particle or a 

H'V 

photon can then be tracked in this coordinate system through Eqs.(9.2.15) and 
( 9 . 2 . 7 ). The coordinate system however has arbitrary elements and in any case 
is not locally observable. The rest of this chapter is concerned with what 
the equations of motion in such coordinates actually mean, and how to predict 
the results of real physical measurements. 


9.4 The metric tensor in some simple situations 

We want to relate the coordinate system in which g 


of motion are expressed to real measurements, 
interval 


iiV 


and the equations 


We employ for this the invariant 


dT^ = 


ff dx dx 
^ V 


(9.4.1) 


In an infinitesimally small region of spacetime labelled with coordinates x , 
the laws of physics are the same as in a Lorentz frame of reference momentarily 
at rest with respect to this coordinate patch. This is how special relativity 
handles accelerations and it works (see Chapter l). In particular, a real 
short measuring rod has the same length in both systems and identical clocks at 
rest in each system record the same small interval of elapsed time. In tho 
Lorentz frame 

A small rod of specified manufacture (for example, of length 10® carbon atoms) 
has length (if at rest) 

(it| = (9.4.3) 

(i runs from 1 to 3). A clock at rest ticks off 

at| = (J§2 _ _ (9.4.4) 

Now 
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dx = ^ d? 




dx 


The quantity d?.is zero because the Lorentz frame is locally at rest 
•with respect to the arbitrary frame. Then 

d§. = -r— dx. 

1 Sx j 
J 

and the length of a measuring rod at rest is given by 

35. 3?, 

Notv 

31 3?o ai 3|„ 3?„ 


oC 


(9.4.5) 


’UV 


= T1 


aP 


g g _i _i _o _q 


Bx Bx 
la V 


Bx Bx 
\l V 


Bx Bx 

V 


and so 




. ^?n\2 

=■ “ I'&T) 


because B§^ /Bx^ vanishes, and 

Sjk = ^ 


(9.4.6) 


if 3 |^/Bx^ vanishes. This is not necessarily 


Clearly (d§.)2 = g dx. dx^ __ _ ^ 

the case, even though B?^ /vanishes. ¥e also have 


!!«!!• 

«ot 

and so obtain for the case where B§^/Bx^ vanishes 




=ok 


vr; 


Thus the proper length of an infinitesimally short measuring rod [3] is given 

i>y . g .g , 

d^2 = dx, dx,, I g - j (9.4.7) 

^00 


and clearly an interval of proper time as recorded by a clock of specified 
manufacture at rest in the arbitrary frame is given by 


dT^ 


: - g dx"^ 

^00 O 


(9.4.8) 


These relations should be general, because we originally defined the g^^ in 
terms of partial differential coefficients linking a local Lorentz frame in 
free fall with the specified frame: we did not specify the relative velocity 
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of the frames. We can get the same ansT^er in a different Tvay, [4]. If 
dx^ = 0 then clearly dT^ = -g^^ dx^ . For measurement of distance we employ 
a light pulse transmitted, reflected from a mirror and received at the origin, 
and time it with a real standard clock. The length between source and mirror 
is At, identical to the length of an identical measure in the comoving iner¬ 
tial frame. The proper length is thus At/2c , where At is the time elapsed 
on either standard clock between transmission of a light pulse and reception 
of the echo. In the arbitrary coordinates the propagation of light follows 
the relation 


g dx dx =0 
^1-LV \1 V 


g dx"^ + g. . dx. dx. + 2g . dx dx. =0 
^00 O 1 J ^01 o 1 


dx 


: = - g • ^ g . dx. - g g. . dx. dx. rr • (9.4.9) 

o g L^oi 1 V \^oi 1 ^oj j ^00 1 j/ I ^ 


On the outward journey dx^ is positive, on the return journey negative (or 
vice versa). The total coordinate time elapsed over this very short distance 
is thus 


Ax 


= - . dx. g . dx. - g g. . dx. dx. 

O g V ^01 1 ^oj J ^00 1 j 


The corresponding element of proper time recorded by the clock at the source 
is 


At 


v: 


/g . dx. 
" V ®oi 1 


g . dx. - g g. . dx. dx. 

OJ J ^00 1 J 


(9.4,10) 


and At = 2At (with c = l) and 


At^ = dx. dx. I g. . _ ^ ) (9.4.11) 

once more. It should now be clear that the measured local velocity of light 
is always the same number c (whether defined as 1 light-sec s”"^ or as 
3 X 10^° cm s""^) even if the coordinate velocity is not. However, if we time 
light over a distance such that the approximation of constant g is not 

|JiV 

valid, the measured average velocity of light will not be c . 


We will now examine some simple cases in which the arbitrary frame 
related through straight line motion to a specified inertial frame 
let 


X = 


X- vt 


Ti 


T = 


t - vx 

Vi - ^ 


vl-v" Vl-V 

From these transformations we compute g 


Y = y Z = z . 
and find of course g 


K IS 

. First 
(9.4.12) 
= 11 , 


M-v - - --- 

The coordinate time is the proper time and the space markers are uniformly 
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separated in a \vell defined proper length. The velocity of light is c(=l) 
in all directions and over any range, both in terms of the coordinates and as 
actually measured: the transformation is the Lorentz transformation charac¬ 
terised by 



Now suppose 

X=x-vt T = t Y=y Z = z (9.4.13) 


Again 



= V 


but this is the Galilean transformation which is abandoned in special relati¬ 
vity. In general relativity it is perfectly admissible. We at once find 


gQO -- (1-v ) , g^^ - g 22 - §33 1 1 §01 %0 (9.4.14) 


and all other components zero. The derivatives of the metric tensor vanish, 
so there are no inertial or gravitational forces. The values of the compo¬ 
nents do not depend on the coordinates and so the velocity of light is equal 
to c in any direction, measured over any range with a real clock and a real 
measuring rod. The velocity of light measured in terms of the coordinates 
does not depend on position, but it is not equal to c and depends on the 
direction. 


dx 


dx 1 

” I = 0 


®01 ~ ® 00®11 


1-v" 
± 1 - V 


± 1 + V 


(9.4,15) 


dy 

dx 

o 


dT^ = 0 


dz 

dT" 

o 



(9.4.16) 


The relationship between proper time and coordinate time at any point is 

dT = '/l - v^ dt (9.4.17) 

and A 

= I - - At^=Ay, Al = Az . (9.4.18) 

* Vi-v" y 

It is quite clear what is happening. In both this case and the previous case 
we are transforming between two inertial frames. In the first case we used 
the Lorentz transformation which links coordinates measured in standard proper 
units to coordinates in the second frame measured in standard proper units. 

In the second case the two reference frames are the same as in the first case, 
but in the second frame the coordinate clocks are all running at a faster rate 
than standard clocks so as to allow T =z t despite time dilation and the x 
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coordinates are chosen expanded so as to aIIo\^ X = x-vt despite the Lorentz 

contraction. A redefinition of coordinates in this frame takes us hack to the 

first case [ 5 ] . This example is helpful in understanding some of the features 
encountered with accelerated motion. 

9.5 An example of an inertial field 

Consider a space vehicle with its engines blasting so as to produce a 
constant acceleration g as measured by an accelerometer on board. The 
accelerometer is therefore accelerating at g with respect to the comoving 

inertial frame. We track the vehicle from a given inertial frame by using 

the four-velocity and four-acceleration. In the local instantaneously comov¬ 
ing inertial frame carrying coordinates the proper time coincides with 

proper time on the vehicle, and so 



( 9 . 5 . 1 ) 


and is the only non-zero component of the four-acceleration. Lorentz trans¬ 
formation of the four-acceleration gives in a specified inertial frame 



( 9 . 5 . 2 ) 


where v is the velocity of the comoving inertial frame in the fixed frame. 
The four-velocity in the comoving frame has one component 



so 


dT 



V 


( 9 . 5 . 3 ) 


Then 




= clT^ - dX^ 


SO 



whence 



g 


( 9 . 5 . 4 ) 



where 


V 


_ ^2 

“ dT 
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This is the equation of motion of the vehicle in the X frame, and is also the 
equation of motion of a particle acted on by a constant force, for example an 
electron in a constant electric field. 


It Mill he more convenient to note that since the magnitude 
acceleration is an invariant 


With 


Me find 



- ( 0 / = 




_ 1 

~ \ dT / 

d^X dX 

d^T Q 

dT2 

dT 


dT^ 


dT 

dT 


djT 

dT^ 


dT 


of the four- 


(9.5.5) 

(9.5.6) 


(9.5.7) 


Differentiate once, substitute and solve. The solutions are exponentials 
Mhich can be combined to form the hyperbolic functions. Since T T as 
gT -> 0 and X ^gT^ , the solutions are 

X = g"^ (coshgT- 1 ) , T = g“^sinhgT , v = tanhgT . (9.5.8) 


(results Mhich were already implied by the form of (9.5.6)). The quantity T 
is proper time as registered on a clock at the same place as the accelerometer 
reading g . We now equip the astronaut in the vehicle with a framework of 
rods and clocks reading a coordinate time = t(0), which is the proper time 
in the frame instantaneously comoving with the observer. In this comoving 
frame we have coordinates (and , §^) and freezing the picture instan¬ 

taneously 


X = 


Vl - 


(9.5.9) 


The astronaut has coordinates X(0) , T(0); §j^(0) = ® values of 

X, T corresponding to arbitrary values of are obtained from Eqs. (9.5.8) 

and ( 9 . 5 . 9 ): 

X - X(0) = 5^ coshg t( 0 ) = coshg 
T - T(0) = sinhg t( 0 ) = sinhg 


at the same instant of coordinate time, whence 
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X = (g-^ + coshg 5^ - g ^ 

T = (g“^ + sinhg 5^ . (9.5.10) 

All we have done is to use special relativity to give our astronaut a refer¬ 
ence frame coinciding with that of the comoving inertial frame with Lorentz 
coordinates: it is for this reason that we keep the labels . (if the 

acceleration is not constant establishing the equivalent frame is much more 
complicated.) We have from Eq. (9.5.10) 


dT2-CiX2 = (l + g|^)2d|2_d|2 . (9.5.11) 

The only term in different from the corresponding term in n ^ is 

g^o =- (1 +g • Thus proper time elapsed depends on position in this very 

natural coordinate frame, and the acceleration as measured at any given 
departs from the value g by a factor (l + g§^) . 


Let us work out accelerations as a function of position. To do this we find 

the equations of motion of a freely falling particle, using Eq. (9.2.11) and 

(9.2.14). The only non-zero derivative of the metric tensor g is 

^lav 


-g|— = - 2g (1 + g . (9.5.12) 

If we eliminate the proper time on the falling particle from Eq. (9.2.11) we 
obtain 



• d§, d§ d§, dP. 

V ^ -pO H 

vu d| d| d§ d? d§ 

0 0 0 0 0 


(9.5.13) 


The only non-zero components of the affine connection are 


pi 

■^00 



g(i + g?^) 


so that 


10 

d2? 


_ "r® _ i 
- ^01 - 2 


af g;„^ = g(i.gg- 


-^=-g(l + g?l) +2g(l+g|^)-l (9.5.14) 


\d§ J 
o 

is the equation of motion of a freely falling particle. Note particularly 
that in this inertial field the acceleration is independent of the transverse 
velocity. 


The acceleration recorded by an accelerometer at will correspond to 

(d?./d§ ) = 0 but the coordinate time interval d§ is not the local frame 

j- u O 

proper time interval dr except at = 0 . At fixed we have 
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dT^ = (l + g§^)^d5^ and so for fixed 5^ the acceleration recorded is 
hF P 




a?; 


2 \ dT 


) = 


(i + g?i)^ 


for 


31/ 


<i§! 


= - g(l + g §j^)' 


(9.5.15) 


= 0 . 


The space coordinates are uniform and isotropic when measured with real measur¬ 
ing rods: the reason why the acceleration differs from g for ^ 0 is 
that seen from a given inertial frame this coordinate system on the spacecraft 
is shrinking longitudinally as the spacecraft accelerates — a progressive 
Lorentz contraction. If the acceleration was g for all , as it could 
he for markers attached to separate rockets firing in an identical pat¬ 

tern, then a given length Awould increase with time as measured with stan¬ 
dard measuring rods. 


For small §i, g ? 

X ^ + (ig ?^) 

T = + (ig §„) 

This is obviously the equivalent of setting 

x= + 


(9.5.16) 


(9.5.17) 


T = ?o + V 

in the Lorentz transformation (neglecting v^) rather than using the Galilean 
transformation 

X = §1 + V §2 

T = I . (9.5.18) 

o 

It is for this reason that gpj.| vanishes: if we assumed (for small g a 


transformation 


then we would have 


=01 

X = ?! + ig 
T = § 


— ^ 1 Sr 




^01 


The proper acceleration, as measured by the acceleration in the ? frame of a 
test particle released from rest, is the same in both cases. 


The coordinate system constructed from special relativity has the advantage 
that the inertial force appears as the gradient of a time independent potential. 
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thus naturally having this feature in common \iith a force (iue to a permanent 
gravitational field. 


Let us confine ourselves to a region sufficiently close to an accelerometer 
reading g that we may set 


g 


00 


(1 + 2g . 


The coordinate velocity of light is thus given by 

(d|)2 _ (i + 2g de2 = 0 , •3|-=i + g§i 

where 5 is measured from the accelerometer. Because dj = (l+g§^) d§^ 
at a given , the round trip time over a very short path clearly gives a 
measured local velocity c = l everywhere. 


We can calculate the bending of a ray of light in the § coordinates using 
the coordinate velocity. If we consider a plane wave initially propagating 
at right angles to the acceleration, say along 5^ i then the arguments of 
section 5.1 yield 


1 Sc 
c 


and the angle 


111 

a5| 

- g A §2 


(9.5.19) 


after traversing A • 

This is in agreement with the result of section 1.2. Note that this is the 
angular deflection that would actually be measured locally because the space 
coordinates 5^ are uniformly spaced as measured by standard measuring rods 


From g^^ we may also compute the redshift. The quantity is a univer¬ 

sal coordinate time. Consider two standard clocks at rest with respect to 
the coordinates §, one at = 0 and the other at a particular value of . 
Corresponding intervals of proper time are given by 

= (1 + g §3^) 

dT(0) = . 

Let the proper frequency of an atom be v. Let At( 0) be the time between 
two successive pulses of such an atom at rest at 5=0. Because in these 
coordinates g^^ is independent of and the atom is at rest, these two 

pulses of radiation are spaced by = At( 0) at In this same interval 

of coordinate time the proper time interval at is At( 5^) = (l + g 5^) A5(0) 
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in T^^hich time (l + g §i) pulses are emitted by an identical atom at rest at . 

v(l,) 

^=l + g?, (9.5.20) 

T^hich again agrees with the results of section 1,2, Note that g can be 
measured as the acceleration at 5^ = 0 and is a distance that can be 

directly measured with a real measuring stick. 


This section has been concerned with the inertial field appearing in an accel¬ 
erated laboratory. In a permanent gravitational field, additional features 
are present because the spatial part of the metric tensor is no longer 6. . . 


9.6 The external spherically symmetric gravitational field 

The Einstein field equations relate the metric tensor g to the 
energy-momentum tensor and determine the quantities g^^ up to a coordi¬ 

nate transformation. The quantity on the left-hand side of Eq. (9.3.1) 
contains second derivatives of g corresponding to Q h , and bilinear 
products of first derivatives, corresponding to the energy-momentum tensor of 
the gravitational field itself. 3^ is thus the energy-momentum tensor of 
everything except gravity: solution of the Einstein field equations in a 
region where 3^^ = 0 corresponds to the solution of the Newtonian equation 
in empty space , 

=0 . 

The latter equation is trivial to solve for the case of spherical symmetry 

yielding -jj _ £ 

'r — r 


and we identify the constant K with - GM where M is the (spherically 
symmetric) mass at the origin. 


The corresponding solution of the Einstein field equations is obtained as fol¬ 
lows [6] . First, set 3i = 0 . Secondly, choose a coordinate system. If 

JJV 

the coordinate system is taken to be in the rest frame of the source, it seems 
natural to choose coordinates such that g^^ does not depend on time. We 

would also like time-orthogonal coordinates: that is, g^^ vanishes. For a 
spherically symmetric field it is attractive to leave the tangential parts of 
a small interval alone, and write 

dT^ = B(r) dt^ - A(r) dr^ - r^(d9^ + sin^G dcp^) (9.6.1) 

in spherical polar coordinates, or 

= A(r) , ggg = r2 , = r^sin^G , = _ B(r) . 

The effect of the gravitational field is embodied in g^^ and g^^ , 
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This choice of coordinates 

^ =0 provided that 

\1V ^ 


satisfies the Einstein field equations v/ith 


g 

=-l 

[i 

--J 

^00 



r J 1 

g 

= 1 

ii 

+ ” 

&rr 


1. 

r J 


(9.6.2) 


Since the gravitational field is given in the Newtonian limit by i 

constant K is identified with - 2MG , where M is the mass contained within 
r as r -♦ a>. This is the Schwarzschild exterior solution in standard coordi¬ 
nates 

dT^ z= (^1 - - ' ^^2GM “ “ r^sin^Q dcp^ . (9.6.3) 

^ ~ r 

Time is affected, radial lengths are affected but tangential lengths are not. 

In Chapter 6 we found that all short lengths were equally affected by a gravi¬ 
tational field. It is therefore not surprising to find that we can redefine 
the radial coordinate r so as to achieve this as a property of the 
Schwarzschild solution. Set 


when 






(9.6.4) 


. MG\ /, MG', 

dr = U- 2 ^ 


SO that 


dT^ = 


1 M 


1 + 


\ 2 


m 

2p 


dt^ 


- (dp^+p^d0^ + p^sin^S d(p^j (9.6,5) 


This expresses the Schwarzschild solution in isotropic coordinates. If the 

Schwarzschild metric is used in the standard form, the equations of motion 

resulting are closely analogous to the Newtonian equations of motion. If 

isotropic coordinates are used measurements over small distances with real 

rods are related to the coordinate intervals merely by an isotropic change of 

scale. Furthermore in the weak field limit we obtain precisely the tensor 

theory of gravity developed in the first part of this book from the equations 

h =8 nG3 
IJV 
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Both systems of coordinates are "widely used and neither is in any sense the 
more fundamental. There is a third system of coordinates also frequently 
encountered, the so-called harmonic system of coordinates Lll in "which 


1 

R 

1.^ 


^io 


= 0 


(9.6.6) 



To first order in the usually small quantity G^/R or GM/p, and 

in the isotropic coordinate system and the harmonic coordinate system agree. 

The forms of e also agree to second order. Consequently the t\vo systems 
^00 

are equivalent for the solar system measurements ve have considered and \tfe 
shall discuss the harmonic system no further. Hoivever, the standard and iso¬ 
tropic forms of the metric differ in first order in g^^ and g^ and in 
second order in g^^ . The physical content is nonetheless the same. We 
shall work out the predictions for the measurable quantities already discussed, 
namely the gravitational redshift, deflection of light by the Sun, radar echo 
delay and the precession of the perihelion of Mercury, using both coordinate 
systems. 


9.7 The gravitational redshift 

Consider first the gravitational redshift. An interval of proper time 
is related to an interval of coordinate time through 

dT = 'Z- g * dt ( 9 . 7 . 1 ) 

00 

for a clock at rest in the coordinate system. At two different levels in the 
gravitational potentials different intervals of proper time correspond to the 
same interval of coordinate time. Let two pulses of light be emitted from a 
source at a radial coordinate r^, an interval of coordinate time At apart. 
The elements of the metric tensor are independent of coordinate time so that 
the coordinate time separation at a radial coordinate r is still At . 


At = 


AT(r^) 


ATCr^) = V-g^^Cr^) At 


A'T(r2) l~i 




(9.7.2) 
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To first order in 
dard coordinates: 


GM/r , has the same form for either isotropic or stan- 

using the standard form 



(9.7.3) 


The frequency of signals received at r^ , measured with a real standard 

clock, is related to the frequency of the same signals emitted at r^ , mea¬ 
sured with a real standard clock, by 


or 



(9.7.4) 


The frequencies and are measurable quantities. The right-hand side 
of this equation however contains the coordinates r^ and r^ and a mass M . 
In order to predict the redshift we need to reduce the right-hand side to mea¬ 
sured quantities. Over small distances 



V 



Ar = r^ 


r 


1 • 


The quantity Ar is a coordinate distance. The radial distance AL 
r^ and r^ measured with a real measuring rod is related to Ar by 



Ar Ar 


between 


(9.7.5) 


if we only want the expression for the redshift to first order. We may there¬ 
fore replace the coordinate interval Ar by AL , measured with a real measur¬ 
ing stick. Finally, the acceleration of a freely falling particle is given by 


Eq. ( 9 . 2 . 11 ) and for negligible velocity we have 


dt'^ 


GM 

^2 


( 9 . 7 . 6 ) 

if we ignore corrections of 


The measured acceleration g is equal to d^r/dt^ 
order GM/r and higher powers, so the redshift recipe is, to first order in 
the gravitational potential. 


— — g AL 
V ^ 


( 9 . 7 . 7 ) 


an expression which contains only measured quantities. This frequency shift 
clearly disappears if the laboratory containing the source and the detector is 


in free fall instead of fixed in the coordinates. 
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9,8 Deflection of light by the Sun 

We vill calculate the deflection of light by setting dT^ = 0 for a 
pulse or vavefront moving at the velocity of light. The coordinate velocity 
of light is then given by 

g dx dx = 0 . ( 9 . 8 . 1 ) 

The local velocity of light, as measured by timing over an infinitesimal dis¬ 
tance, using standard clocks and rods, is always c . 


Consider a plane wave propagating at right angles to the gravitational field 
in a small laboratory. Label the radial coordinate by y and the tangential 
coordinate by x . Then the angle turned through in distance Ax is given by 


where u 


Ax 


1 ^ 
u By 

is the tangential coordinate velocity of light. 


dx ^ 


( 9 . 8 . 2 ) 


In isotropic coordinates the coordinate velocity of light in any direction is 


so 


' MG^ (. m\ 


-3 


2GM 


dx 


2GM 


Ax = 


2GM 


Ax 


( 9 . 8 . 3 ) 

( 9 . 8 . 4 ) 


Because the coordinates are isotropic, this angle preserves its value in real 
units. But we measure it with a telescope clamped to the far wall of the 
laboratory of dimension Ax . Now 


AL^= VI 


GMN^ 




) Ax 


( 9 . 8 . 5 ) 


So the walls of the laboratory are set at a small angle to the coordinate 
frame, just enough for the telescope to measure an angle 


^ AL^ (compare section 6.2) 

where AL is the measured distance across the laboratory, and the quantity 

X 

(GM/r^) = g is the measured acceleration of a slowly moving particle. (This 
result is again only to first order in GM/r.) The variation of 
relating the measurement system to the coordinates can only be ignored if the 
first derivative of GM/r can be ignored — in which case to this approximation 
there is no deflection. 


In standard coordinates the 
velocity is 


picture is a little different. 



The tangential 

( 9 . 8 . 6 ) 


G.R.—F 
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dx - J.2 


(9.8.7) 


A tangential length AL = Ax because gp^ and g have their flat space 

X U 0 CpCp 

values. A length related to Ay by 


AL = 

y 


Ay 


^/l_ 2^ 


dy GM .. 


if we only want an answer to first order. The side walls of the laboratory 
are parallel to the y coordinate in this case and the telescope measures an 
angle g AL^ once more. 


Thus if we calculate the result of a measurement of the deflection of light in 
a small laboratory at rest with respect to the source of mass M, we get an 
answer independent of which set of coordinates we used, and the answer satis¬ 
fies the principle of equivalence. 


The coordinate deflection of light starting its journey very far from the 

deflecting mass and being received very far from the deflecting mass can be 

calculated by integrating the deflection over an (approximately) straight path, 

as in section 1.4. In isotropic coordinates the velocity is, to first order, 

1-2—. It is clear from comparison with section 5.1 that the deflection in 

4GM 

isotropic coordinates is then given by a = where b is the impact para¬ 

meter, approximately equal to the distance of closest approach. 


In standard coordinates the coordinate velocity of light is direction dependent 
and the treatment is a little more complicated. We can either solve the equa¬ 
tions of motion (9.2.15) or simply use the variation of the coordinate velocity 
of light. In the latter approach we take the condition for light 


dt^ = 0 = ( 1- —' dt^ - 


dr^ 


r / 


2GM 


■(d0^ +sin^e dcp 2 ^ (9.8.8) 


and set 


da _ du ^ f dx\ 
dx “ u dy "dy Vdt/ 


dx 

as before. Here x is the distance along the path, with ‘^^1. We choose 
0 =z 90°, so d0 = 0 , and have 


0 = clt 2 ! 1 _ 


A j 2 .a 2 

- —; - dcp^ 


dr^ 


1- 


2GM 






GRAVITATION AND THE GEOMETRY OF SPACETIME 


145 







df^ 


(9.8.9) 


\dx/ 

(see Fig.9. 8 . 1 ) \vith 

r~Z 2 dr X y dcp y 

= Vx2 + y , dj = 7 , coscp=^ , -55 = —. 


We no^/i have 

2 ( 2 GMx 2 '., / 2GM\ .,2 

0 =.dx2 {,1+-^; - V^-—; 

( 9 . 8 . 10 ) 

(to first order) and the velocity of 
light along its nearly straight trajectory is 

dx , GM/, x2' 

dt r V t-2 / 


Fig.9.8.1 Coordinates employed in 
calculating the deflection of light 
by the Sun, Eq. (9.8.9) 


( 9 . 8 . 11 ) 


GM 


When x=0 this is the tangential velocity, 1 -, and ’when x^^r it is the 
radial velocity 1-—^. 

Then 00 

J (9.8.12) 

— 00 ^ 

once more. The change in angle with b relative to a fixed direction (for 
example a source not significantly affected) is thus given in either standard 
or isotropic coordinates by |j" ^ to first order in -jj • The angular change 
is identical to that actually measured in isotropic coordinates: in standard 
coordinates the difference in the coordinate angle and the measured angle is 
0 where R is the radius of the Earth's orbit: such corrections, of the 

order of one part in 10®, can be ignored in this problem. 


9.9 Radar echo delay 

The excess time delay encountered in radar echo experiments is more 
instructive. The time for a round trip between the Earth and Mercury is 


r 

e 

In isotropic coordinates we set 


( 9 . 9 . 1 ) 


dx 4 2GM 

dt “ ^ r 


as in section 5.2 and obtain 


r* 4i* p ^Tn 1 

tf = 2(x^ + xj + AGMq to [ -p—J 


( 9 , 9 . 2 ) 


where h is defined in Pig.5.2.1 and is approximately the impact parameter. 
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In standard coordinates, using Eq. (9.7.11) we have 

GM^ 


T- V + fa] 


4r r 




4GM, 




(9.9.3) 


In these expressions we have set Xg , equal to r^ and rj^ in the loga^^ 

rithm ic terms . We can express x^ and in the zero order term through 

x^ = /\/r^-b^ but it is better to express it in terms of the distance of clo¬ 
sest approach to the Sun, r^ . Because the angle turned through in going from 
x=0 to x»rQ is approximately 2GM/rQ , we have 


where g is non-zero because r^ ^ rj^ 


o 

Then 


= Vr^ - = /r^ - fr + (b - r ) Y 

e e \ o ^ o'/ 


and 

and 


X + X 

e m 


“Vr^-r^ + Vr^-r^ + 4 


GM 


t = 2( /r^-r^' +Jr^-r^') + 4GM ^ii(-%-S) + 8 GMq 

I Vv e o \ m o / 0 \ ^2 / O 

o 

"S = " 4GM<,te(l^) - 4GM^ 


(9.9.4) 

(9.9.5) 


These two answers are different, but are at once reconciled by remembering that 
isotropic coordinates are constructed from standard coordinates by the trans¬ 
formation 

/ Mn \ ^ 

(9,9,6) 


( . MG ■'i 

V ^ ^ 2r^) 

leaving everything else alone. 


r^ + GM 


The quantities t , r^ , r^ and r^ are not directly measurable: rather they 
are convenient parameters related to directly measurable quantities. The time 
interval between the transmission anu reception of a radar pulse is measured 
with an earth-based atomic clock which is running at a different rate from 
coordinate time. There are two ingredients in the transformation. Because 
the clock is in the Sun's potential (and indeed in the potential of the Earth) 
we pick up a factor -^1- (GM^r^) and because of the o rbital vel ocity of the 

Earth relative to the coordinates we pick up a factor ^/l~- {v^/c^) , where 

,.2 GM^ 


The measured proper time interval tp is thus related to the coordinate time 
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interval by 

(to first order in . 


t 

P 




(9.9.7) 


The total travel time is ^1.2 X 10^ secs for the Earth-Mercury-Earth trip 

and the correction factor is ^^1.5 X 10"® , introducing a difference between 

t and t 20 ps , which is not negligible since the total excess delay due to 
P ^ 

gravitation is ^ 200 ps . 


The parameters r^ and r^ are not directly measured either: they must be 

related to something which can be measured. For example, measurement of the 

time delay when Mercury is at inferior conjunction (between the Earth and the 

Sun) and at extreme elongation provides two times in terms of which r^ and 

r can be evaluated, using of course the equation of motion of light in the 
m 

Schwarzschild field. More generally, observations of the time delay can be 
made over a period of one or more years and the equations of motion of light 
tested against these results, with r^ and r^ as parameters to be fitted 
along with many others. Alternatively we may use the directly measurable 

orbital periods and the equations of motion of the planets (expressed in 
Schwarzschild coordinates) to determine r^ and r^ . In isotropic coordi¬ 
nates the equations of motion are those we used for calculating the precession 
of the perihelion of Mercury in Chapter 7 so that the period of revolution 


T 


e ■ 



( 9 . 9 . 8 ) 


where a = 2 . 
when 


The substitution R = Rq-GM takes us to standard coordinates 


T 


e " 



( 9 . 8 . 9 ) 


which is the Newtonian expression. ¥e thus see (indirectly) that the coordi¬ 
nate time period is related to the coordinate radius of the orbit by the 
expression obtaining in Newtonian gravitation, if we use standard coordinates 
rather than isotropic. Again it must be remembered that Tq is coordinate 
time and the period measured with an earth-based atomic clock must be corrected 
to give Tq . The quantity GMq represents a distance of 1.5km and a time of 
travel at the velocity of light of 5 ps . 


It should be clear that the problem of testing the predictions of gravitation 
for radar echo delay is perfectly well defined in terms of measurable quanti¬ 
ties, but of great technical (rather than conceptual) complexity. Interesting 
detailed discussions exist [8] : we only remark that the predictions of 
Einstein’s theory are currently verified at about the 3^^ level [9] . 


148 


GRAVITATION AND RELATIVITY 


9.10 The precession of planetary perihelia 

The precession of planetary perihelia is more complicated to calculate, 
since Me need the equation of motion to second order in small quantities. How¬ 
ever, comparison of the equations of motion and the Schwarzschild metric tensor 
in isotropic coordinates with the weak field tensor equations of Chapters 6 and 
7 (with the non-linear parameter a = 2) shows that we have already solved the 
problem in isotropic coordinates : the advance is 6 tt radians per revolution. 

In standard coordinates we write 


so that 


dT^ = (^1 - dt^ - r^ sin^G dcp^ - r^ d9^ 




1 

CM 

2GMy^ 

= - g dx dx 




r / 

^liV \1 V 



2GM', 


2 • 2a ”7 


- 

-—J 

®CfCp 

= r‘^sin‘^0 


2 



2GMV1 

See = 

: r 


Srr 

= V-—) 


(9.10.1) 


(9.10.2) 


It should be clear that since is a potential which couples to the square 

of velocity, we need only to first order in GM/r , but g^^ to second 

order. In contrast with isotropic coordinates however, there is no nonlinear 
term in g^^ in standard coordinates. ¥e shall obtain the equations of 
motion in standard coordinates and then evaluate the precession rate by the 
approximate methods of Chapter 7. We work in polar coordinates, both because 
polar coordinates are appropriate to the problem and because g^^ is diagonal 
in polar coordinates : the standard form is not diagonal when expressed in 
rectangular coordinates. 


In principle we must compute the affine fields and then solve 

dx.. dx 


d^x 

X pX 
^ Vv 


dT 


_V 

dT 


(9.2.11) (9.10.3) 


but we can employ some short cuts. We choose Q = 2 ^ define the plane of 
the orbit and so are not interested in the equation for 6 . Further, if we 
find the quantities dt/dT and dcp/dT we can then use the expression for 
proper time to give us the equation of motion for r . We therefore only need 


to calculate the nonvanishing members of 
Now 


and 

lav 


pCp 

M-v* 


Xji ^ ^ I bx, '** 3x 
X ^ji 




dx J 


(9.2.14) ( 9 . 10 . 4 ) 
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and for diagonal the elements of the inverse matrix g^^ are jnst the 

reciprocals of the elements of have 



d^t /' t -pt|''dt'', /'dr'', „ 

77 + l^rt + ^tr) ' 

(9.10.5) 

all other 

terms vanishing, with 



t r* 13^ 1 GM 

■^rt tr 1 2GM ^.2 

(9.10.6) 

and with 

6 constant, d9/dT zero , we have 



^ + I'r'p + r'P 'l ■(—j = 0 

jjq-2 \ rep epr/ vdT/ sdT/^ ’ 

(9.10.7) 

all other 

terms vanishing, with 

„cp _ pCP 1 ^ „ _ i . 

rep epr Zg^ dr ®CfCP r 

(9.10.8) 

Then 



dft 2^ 



^ dr 

(9.10.9) 


dt 2GM dT ~ 

dT ^ " r 


■whence 

In ^ + In = constant 

(9.10.10) 

and 



^ 2GM\ 

dT = K 

(9.10.11) 

where the 

value of K is determined by the velocity of the particle. 

Similarly 


d^cp 



dT^ 2 dr „ 

dCp r dT " 

(9.10.12) 


dT 


whence 

dCp 

tn — + 2 tn r = constant 
dT 

(9.10.13) 

or 

r^ = constant = H . 

dT 

(9.10.14) 


This equation clearly expresses conservation of angular momentum. With 



■we can express the radial equation of motion as a function of either t or t : 

SGMVl 

. {,.m) . (i/ . 
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For the moment \ie \^?ork with the latter form, which we write as 

^2 , On>yf\— 3 T_2 


where 


t.2 1 f^rY (■ 2GMV^ Y 

^ - Vm; [^-—j - ^ 

" r 


Differentiate (9.10.16) with respect to time and obtain 

1 _ ^ ^ /■ 2GMV^ 

wy ^2 Vdty “ ^ dt^ \^““ 

r 


0 


1 ( Or\ ^ , 2GMV 


We drop the last term and write 


d^r GM /. 2GM' 




dt^ 


\^-TJ • • 


For a circular orbit 


d^r 

dt^ 


= 0 and so 


R 


hf _ GM I 2GM\-^ 
h3 - r2 V - H 


R'" R 

From Eqs. (9.10.17) and (9.10.20), 

^ _ fW 

dt ~ "J 

and the rotational period in standard coordinates is 


cp 


GM 


which is the same expression obtaining in Newtonian theory (see Eq. 
We now set r = p + R and obtain 

£f£ _ _ (1 


dt2 


^ (1 6 GM \ 

r3 - E ; 


and the radial period Tj, is given by 


Tjj = 2 n ,/ 7 ^ (1 


3GM\ 


GM ^ B 


The advance of the perihelion each revolution is then 

-T 

IS 
r 

cp 


o „ i Ji 1 1 « „ GM 

2 n - Ij - 6 tt JJ 


(9.10.16) 

( 9 . 10 . 17 ) 

) . (9.10.18) 

( 9 . 10 . 19 ) 

( 9 . 10 . 20 ) 

( 9 . 10 . 21 ) 
( 9 . 9 . 9 )). 

( 9 . 10 . 22 ) 

(9.10.23) 


once more. 
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CHAPTER 10 

BLACK HOLES 


10.1 strong gravitational fields 

In the development of the relativistic field theory of gravity, \^7e \ient 
only as far as first order in the gravitational potential. We tackled the 
problem of the precession of planetary perihelia, which involved second order 
terms, by supposing gravitation to obey the principle of strong equivalence, 
but without demonstrating that an internally consistent theory results. 

The Einstein field equations, in principle, are exact and thus provide ua with 
the means to calculate the effect of such gravitational fields of any strength. 
We must recognise however that all tests so far made have been sensitive to 
terms of no more than second order in GM/rc^, this quantity being no greater 
than 10“®. In this chapter we are concerned with strong gravitational fields. 



We are already aware of astronomical objects in the vicinity of which such 
strong fields exist, although we have not as yet been able to study physical 
processes so close in. These objects are pulsars, so called because they 
emit pulses of radio waves (and in some cases visible light) with extreme 
regularity, the pulse periods lying in the range '^SOms to 1 sec, [l] . The 
extreme regularity argues for a clock of stellar mass, while the short pulse 
period requires the dimensions of the emitting region to be less than about 
CT, 1500km for T = 50ms . These objects are believed to be neutron stars, 
rotating at the pulse frequency and producing the pulses by the interaction 
of a magnetic dipole field with the surrounding plasma, [2] . Although the 
mechanism of formation of such neutron stars is not yet elucidated, it seems 
clear that they result from the gravitational collapse of a burnt out stellar 
core. When the core of a star has exhausted its nuclear fuel, thermal pres¬ 
sure cannot support it and if its mass exceeds -^1.5M^the pressure exerted by 
the Fermi sea of electrons cannot support it against gravity either, so that 
collapse is inevitable, squeezing the electrons into protons to form an immense 
nucleus consisting almost entirely of neutrons. The pulsar NP 0532, in the 
heart of the Crab nebula, itself the remnant of the supernova explosion of 
1054AD,is slowing down and the rate of decrease of rotational energy, assuming 


153 



154 


GRAVITATION AND RELATIVITY 


the pulsar to he indeed a neutron star, almost exactly matches the rate at 
Tvhich energy is radiated from the nebula. 

At the surface of an object of mass and nuclear density, 3 X lO^^gmcm"^, 

the gravitational potential is 

the surface radius being '^lOkm. 


GM 
_2 


0.1 


In this chapter we shall only examine the properties of strong gravitational 
fields described by the Schwarzschild solution of the field equations. These 
fields are spherically symmetric and so correspond to a non-rotating source. 
The fields of rotating sources are of course of much greater astrophysical 
interest, but we are concerned here only with explaining what is meant by a 
hole and why it is black. 


10 .2 The propagation of light in strong fields 

Consider the expression for the proper time interval in the Schwarzschild 
field, using standard coordinates : 

cIt 2 = dt^-sin^e dcp2-(J02 _ (9.5.3) (10. 2.1) 


The radial velocity of light is given in terms of the coordinates by setting 
dT^ = 0 (and dcp = dS = o) 


dt 


dT2 = 0 



( 10 . 2 . 2 ) 


The Schwarzschild field of course applies to the empty space outside a spheric¬ 
ally symmetric source. If such a source is concentrated within a coordinate 
radius < 2 GM/c^, then there exists a radius, the Schwarzschild radius, 

B 

s c2 

at which the radial coordinate velocity of light is zero. An observer at 
infinite distance from the source has a clock running at the coordinate time 
rate: he ’sees’ light passing the r markers at ever increasing intervals of 
his time, stopping altogether as r R^ . ¥e must of course be careful about 
the interpretation of this result because we have not examined the signifi¬ 
cance of the radial coordinate. 

The proper length of an infinitesimal measuring rod, oriented along the 
radius vector, at rest at radius r, is 
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and so 


dr = dl 


I m 

J ^ - -r. 


V r 

As r -♦ Rg 1110 r markers are getting infinitely closely spaced. 

We may evaluate the time taken for light to get out from radius r by integrat¬ 
ing (10.2.2) r^ .r,-2GM, 

^ + 2GMte(~ _ 2 gm7 (10.2.3) 


1 - 


2GM 


1 ■ r 

and as r, this time co for any . 

1 s 

If therefore we can use the Schwarzschild solution as r Eg the conclusion 
is inescapable: light cannot get out past r = Es - and so neither can any¬ 
thing else capable of carrying a signal. 

This result might appear to be associated with the singularity in g^^ • We 
see however that the rate at which a standard clock at rest runs is 


dT 


^F- 


2eM 


dt 


and so from any distance greater than Rg clocks appear to run infinitely 
slowly at r=Rg and light is infinitely redshifted. 

Let us examine these effects using isotropic coordinates, in which 


dT^ = 


'. MG V , 

^ 1 dt^ - (l('3r^ +r^ d02 + r^ sin 


. MG 


20 dcp^^ . 

(9.6.5) (10.2.4) 


The spatial piece of the metric tensor has no singularity in isotropic coordi¬ 
nates. However, the coordinate velocity of light goes to zero as r -*■ Rj, 
where 


E-r 


GM 

2c^ 


and the clocks stop at the same radius. In isotropic coordinates 

,2 


dL 


(i 


dr 


which behaves entirely sensibly except at r = 0. The time taken for light 
travelling radially is 


t = J dr 


(l+^J 

= (r -r.) - 2GMLn — 
GM 2 V 

2 r 


+ (f)' (f-f) " (10.2.5) 


2r^-GM. 


which becomes infinite as • 
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From the point of view of an observer located far from the source of the gravi¬ 
tational field, light emitted from a radius r is progressively redshifted and 
takes progressively longer to get out as r tends to the critical radius. For 
r less than or equal to the critical radius, a light signal cannot get out 
and a source of gravitation lying wholly within the critical radius is enve¬ 
loped in an event horizon making communication with the outside world impossi¬ 
ble. We may calculate how far away such an event horizon is from an external 
observer. 


d.t 


dr 



in standard coordinates, where 61 is the length of a real measuring 
stretched between coordinate markers labelled r and r+dr. Then 


I = 


r 


dr 




- 2r GM + 2GM^ 



—v-,r 


2 

1 


rod 


( 10 . 2 . 6 ) 


As r^ R_ and r^ » GM 

1 o 2 

I 


+ GM 



which is finite for finite r 


2 • 

I - 


In isotropic coordinates we find as r^ ->R^ 

II 

"2 - ^ (5f) 


the difference merely reflecting the relation 


Ti = rg + GM for r » GM , 

We are not of course suggesting that a plumb line be lowered into such fields 
to make the measurement. 


The existence of such one way event horizons is not unique to general relati¬ 
vity. Consider a rocket with accelerometer reading g . In an inertial frame 
its equation of motion is 


= gt 


(9.5.4) 


(10.2.7) 


and hence 


gt dt 


1 ■ : = i r/l + g2t2 

J / g L ^ 

“ N/l + g2t2 


1 


( 10 . 2 . 8 ) 


A light pulse emitted at time t' from x = 0 is at 3 ^ = t-t' at time t . 

11 f ^ “ light can never catch up the rocket and so signals from space con¬ 
trol cannot be received by the astronauts — space control lies behind an event 
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horizon. The astronauts can nonetheless signal back to base and life goes on 
as usual in both space control and in the space vehicle. The signals start 
coming in again, of course, if the engines are switched off. 

In the case of an event horizon surrounding a gravitating mass, there is no 
way of switching off the field producing the horizon. However, if the 
Schwarzschild metric and the equations of motion are correct even for r less 
than or equal to the critical radius, life goes on as usual in a space vehicle 
crossing this critical radius: the local physics is identical to that in a 
laboratory far removed from the source of the field, provided that the tidal 
forces may be neglected. 


10.3 P article motion in the field of a black hole [3] 

Let us now consider the equation of motion of a particle falling inwards 
radially. Such a particle has zero angular momentum and so we can write Eq. 


( 9 . 10 . 15 ) in the form 



(10.3.1) 


where t is the proper time on the trajectory, recorded by a clock on the fall¬ 
ing particle. The quantity ^ clearly represents the square of the 

(not the coordinate velocity at infinity) so 


velocity as r 


(■ 

(compare the Newtonian equation 


dT/ 




■) 


and 


dT 


dr 



(10.3.2) 


The proper time elapsing between r^ and r^ is thus 



(10.3.3) 


This is finite for all r^, even for r^= 0. A freely falling particle thus 
passes through the critical radius after a finite time and indeed arrives at 
the singularity at the origin, if accessible, after a finite proper time. The 
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velocity dr/dT increases continually in accord with the Newtonian formula. 

We must however emphasise that dr/dT is the rate at which radial coordinate 
markers are passed measured with an on hoard chronometer. If instead of tak¬ 
ing shipboard proper time as the time parameter we take coordinate time, which 
is also the proper time of an observer at infinity, we must use Eq. (9.10.16) 
and write 



= (l-W [l-K" (10-3-4) 

where 1 - represents the coordinate velocity squared at infinity, which is 
also the measured value at infinity. The velocity 0 as r ->2GM, as 

we would have expected since even the coordinate velocity of light goes to 
zero at the critical radius. We may evaluate the time taken to reach the 
critical radius: 


dr 


_L_ r 


dr 


(10.3,6) 


Consider the two limits : 

(i) 1 - 1 , the test particle having the velocity of light. Then 

/rp- 2GM . 

t = r^-r^ + 2GM to (10.3.6) 

which tends to infinity as r^ 2GM , 

(ii) In the limit 1 - 0 we have 


t = 


4 


M i 

3 r 2 


2GM 


+ 4GM J + (2Gm)^ In ^ (10.3.7) 

'Vr + V2GM'^J 1 


which again tends to infinity as r^ 2GM. 


Thus from the point of view of an outside observer a particle approaching the 
critical radius slows down and takes an infinite time to reach it. From the 
point of view of an observer falling with the particle the critical radius is 
passed after a finite time and life goes on as usual. From the point of view 
of an outside observer, then, anything which passes the critical radius, using 
the proper time of the falling object as the time parameter, is effectively 
lost for ever and we shall now show that this is the inexorable fate of a large 
class of falling objects. It is most convenient to use for this study the 
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Eq. ( 9 . 10 . 15 ): we stick to the standard form of the Schwarzschild solution 
because of the simplicity of the equations of motion in these coordinates and 
because of the close analogy with the familiar equations of Newtonian theory. 

with 


2 ^ 

dT 


= H 


Differentiating with respect to T we obtain 


d^r 

dT^ 




m 

2 




( 10 . 3 . 9 ) 


Equation (10.3.8) is the analogue of the Newtonian equation for conservation 
of energy and Eq. (10.3.9) the analogue of the Newtonian equation for the 
radial acceleration. We interpret Eq. (10.3.8) in the following way. As 
r 00 'we find / 

where f dt\^ _1_ 

VdT/ 1^2 


i _ 1 

\dT/ Tr2 


Since 
where m 


Pu = “olif 


is the proper mass, we identify the quantity 1/K^ with the square 
of the total energy at infinity for unit proper mass, E^^. Then 


(10.3,10) 


In the Newtonian limit we replace t by t , and replacing by (1+T^) find 


= 


A 2 


2T - ^ + 


2GM 


where T is the kinetic energy for unit mass and we have dropped those terms 
00 

which are small in the Newtonian limit. 


From Eq. (10.3.9) it is clear that we may regard the quantity 


^ j u£i 

2 r 2 V - r ; " r 

as the effective potential determining r as a function of t . However, in 
relativistic problems we work with total energy more often than kinetic energy 
and it is convenient to write 

= 1 

j 2 \ f (10.3.11) 

^ • J I 1 +—I 


2 / 


2GM\ 


GM 


where 


(■ 
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is the total energy for unit mass at infinity (and need not be greater 
than unity) and H is the angular momentum for unit mass. 


This potential fimction V^(r) has a long range attractive piece varying as 
r“^, a shorter range repulsive piece varying as r”^ and an even shorter range 
attractive piece varying as r"^ . At very large r , V^(r)'^l. As r decrea¬ 

ses it will go below 1 and then for a sufficiently large value of H reach a 
minimum and increase with decreasing r . This increase is just the centrifu¬ 
gal repulsion, and these are precisely the characteristics encountered in the 
Newtonian problem. However, at very small r a centrifugal attraction prevents 
V^(r) from increasing to an infinite value at r = 0, produces a maximum 
in V^(r) and then digs a hole in the middle. ¥e may note that a centrifugal 
attraction is present even in the equations of motion we obtained in the weak 
field case (Eqs. (5.4.14); (7.3.2)): we were forced to introduce such a term 
to account for the deflection of light by the Sun. 


The maxima and minima of the potential function are given by 


9VltEl,o or !!i(i.22!) « 


The trivial solution of this equation is of course r = 
tions are _ 

2_J_ ± / 1 .. _ -i-' . 

r 3GM 

There is neither a maximum nor a minimum if 


= 0 . (10.3.12) 

CO. The other two solu- 


(10.3.13) 



1 

9G^m2 


H < 2/3 GM 


At this value the maximum and minimum coalesce into a point of inflexion at 
r = 6GM which represents the last possible stable circular orbit. Thus for 
any value of the energy at infinity there is no repulsion to stop a test parti¬ 
cle with H < 2/3GM from falling into r= 2GM and hence being captured by 
the source of the gravitational field. In the Newtonian case of course any 

non-zero angular momentum is sufficient to prevent a test particle falling in 
to r = 0 . 


For a 
occurs 


specified value of H> 2/3 GM the maximum of the centrifugal barrier 
at r : 'i 


max 


1 


r 

max 


GM 




\2 


(10.3.14) 


If the total energy is sufficient for the test particle to cross the barrier, 
then it will be swallowed up. The condition for capture is 
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> V2(r ) . (10.3.15) 

00 max' 

For a particle coming in from infinity we can express the angular momentum for 
unit mass in terms of an impact parameter and the energy at infinity. Far from 
the source, motion is approximately straight line, and so 


r cos cp = h , r sin cp = x 

where cp is the azimuthal angle measured from h , whence 


dcp 

dT 




dT 


so that 

As 00 (corresponding to 

condition (10.3,15) 


h2 = (e2 _ 1) . (10.3.16) 

V 00 

a particle moving with the velocity of light) the 


= V" 


(r ) 
max' 


becomes 


With r = 3GM we have 
max 


b^ (\ 

T /" • 

""max 

rrb^ = 27 tt(GM)2 


(10.3.17) 


and this is the capture cross-section of an object of mass M, sufficiently 
compact to be enveloped by an event horizon, for relativistic particles, 
is also clear that there is no stable orbit for relativistic particles, but 
that an unstable circular orbit exists at 


For non-relativistic particles starting at infinity 

.,2 


= b" 






of course. The energy condition is obtained by setting E^-1~0 when we 


have 


Set 


2GM 

- + 


max 


max 

g 

GM 


2GM 


max 


(10.3.18) 


and solve for a, obtaining 

1 

r 

max 

Compare with the condition determining r^^^ from the maximum of the poten¬ 
tial (10.3.14) 
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and so find 
Then 




r 6GM ‘v 

max 


VH / J 


H = 4GM = V h . 

03 


(10.3.19) 


An object sufficiently condensed so as to lie wholly within its critical 
(Schwarzschild) radius thus exhibits the following features. There exists no 
stable circular orbit inside r=6GM, and any test particle endowed with insuf¬ 
ficient angular momentum irretrievably falls into the critical radius, taking 
infinite time (as seen by an external observer) to get there. An external 
observer cannot meaningfully ask what happens after this radius is passed. It 
is in this sense that such a collapsed object constitutes a hole in space. 
Light cannot escape through the critical radius and as this radius is 
approached light becomes infinitely redshifted. The hole is black. 


10.4 The search for black holes 

It may be that black holes were formed in the early stages of the evolu¬ 
tion of the universe, and it may be that very massive black holes have been 
formed as the result of the implosion of star clusters or the cores of gala¬ 
xies. ¥e shall however limit ourselves to a brief discussion of the way in 
which black holes formed by the collapse of stars might manifest themselves [4]. 


A young star consists mostly of hydrogen. In the early stages of star forma¬ 
tion a cloud of interstellar gas and dust collapses, heats up and at the same 
time radiates away the surplus energy released in contraction. When the 
interior becomes hot enough for the nuclear reactions which transmute hydrogen 
to helium to proceed, despite the inhibiting effect of the Coulomb repulsion, 
the star enters a quasi-equilibrium phase. In this phase the energy radiated 
from the hot surface is balanced by the nuclear energy sources in the core. 
Once the hydrogen in the core is exhausted, collapse of the core starts which 
is stopped when helium is burned to carbon and oxygen. Most of the available 
nuclear energy has been consumed in the formation of helium and so a massive 
core will evolve quickly towards iron, after which no further nuclear energy 
is available. If this core is sufficiently dense that the electrons are 
largely relativistic, the pressure exerted by this Fermi sea of electrons is 
insufficient to stop further collapse. The critical mass for this is known 
as the Chandrasekhar limit and is about l.SM^. 
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If the degenerate core never rises above this limit, the electrons remain non- 
relativistic, the Fermi pressure prevents collapse and the star cools through 
a T^hite dwarf phase in which the kinetic energy of the nuclei leaks away. If 
the core is above the Chandrasekhar limit, collapse continues in an luistable 
situation, the zero point energy of the electrons is raised to the level at 
which they are removed through the reaction 

p + e~ n + V 

and collapse only stops when the neutron pressure balances the gravitational 
pressure. The density is now nuclear density. The minimum mass for a neutron 
star is s^O.lM^. (it is amusing but rather fortuitous that this result may 
be obtained merely by augmenting the semi-empirical formula for nuclear masses 
by a gravitational energy term.) 

If Me take the density of such a neutron star to be 3 X lO^'^gmcm the radius 
for IMq is 10km and ^ 0.1. Keeping the density constant and increasing 

the mass, ^ 1 is reached at a mass of f«4MQ. 

These calculations only give a very rough idea of the mass for which a neutron 
star surrounds itself with an event horizon. It is clear that in order to 
make a proper calculation it is necessary to put in a realistic equation of 
state for the neutronic matter and calculate the equilibrium conditions using 
general relativity instead of Ne^^rtonian gravitation [5]. A neutron star of 
mass less than ^O.IMq will expand to become a idiite dwarf: a neutron star 
of mass greater than a few will itself be unstable against gravitational 
collapse. We expect therefore the collapse of a stellar core with mass ex¬ 
ceeding a few to lead to a black hole. 

In searching for a black hole representing the end point of the evolution of a 
star we should therefore look for ah object of several solar masses or more 
'which is invisible and manifests itself only through its long range gravita¬ 
tional field. If we can find direct evidence of small size (remembering that 
GM^ 1.5km) and strong gravitational field, so much the better. It seems 
hopeless to search for isolated black holes, but some 859^ of all stars are mem¬ 
bers of multiple systems [6]. The search has therefore concentrated on binary 
systems with one apparently normal member. Observation of the characteristics 
of the orbit of the primary can yield the mass of the unseen secondary. An 
example is the peculiar eclipsing binary e Aurigae. The primary has an orbi¬ 
tal velocity of 14km s"^ (obtained from Doppler shifts) and the period of the 
system is 27.1 years (from both Doppler shifts and the eclipses). The velocity 
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of the primary andthe period give the mass of the invisible secondary in terms 
of the primary mass. The latter may be estimated from the visual appearance: 
it seems that the visible star is rather far advanced in its evolution and has 
a mass in the range implying a secondary mass in the range 

, which is certainly far too massive for a neutron star. However, 
the eclipse lasts for about 400 days and if the secondary is compact it must 
be surrounded by a disc of fairly opaque muck which actually does the eclips¬ 
ing. While such a model explains the light curve of the eclipses, we cannot 
conclude that e Aurigae contains a good black hole candidate: it is possible 
that the secondary is simply a bit too faint to be visible [?]. 

The best candidate to date is the X-ray source Cygnus X-1. The visible star 
(HDE 226868) has all the hallmarks of a supergiant with mass 20-30 and from 
the Doppler shifts the velocity of the primary is « 75km s"^, [8] and the 
period is 5.6 days. The mass of the secondary is inferred to be ^5M^. In 
this system there is evidence from the properties of the X-ray source that the 
secondary is compact [o]. First, there are strong irregular fluctuations with 
a timescale of '^O.l sec. and irregular flickering on a millisecond time scale 
[10] . The X- rays are therefore produced from one or more regions only a few 
hundred kilometres across. Secondly, most of the X-ray energy radiated lies 
in the range 10-100KeV . The X-rays are thought to result from the distended 
envelope of the super'giant streaming across the equilibrium point between the 
two gravitational fields and falling in towards the secondary to form a turbu¬ 
lent accretion disc, heating up and radiating [q]. If m gms”*^ fall in to 
radius r before getting hot, then very roughly 

(10,4.1) 

where M is the mass of the secondary, the X-ray luminosity and T the 

temperature of the emitting region. The quantity o is Stefan’s constant. 

The X-ray luminosity is obtained from the X-ray intensity at the receiver and 

estimates of the distance of Cygnus X-1. The distance may be obtained from 

the brightness of the primary and the characteristics of the spectrum. This 

is a bit dangerous because the primary in such a system might be abnormal: a 

further method of estimating the distance is from the reddening of the light 

from the primary by interstellar dust [ll] . The best estimate of distance is 

6000 light years and the corresponding value of £ is 10^^ erg s""^ (about 

10 ^ £ ) . 

© 

If we take T 10*^ °K(corresponding to a thermal energy in the gas orbiting 
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the secondary of '^IkeV) then 
and if M = GMQthen 


r « 100 km 
m ^ 10^^ gm s 


« 10 -® 

each year. This is a very reasonable rate of transfer: it must be remem¬ 
bered that the cooler outer regions of the disc of gas must not be too dense 
or the X-rays could not get out. 


The X-ray source Hercules X—1 provides a beautiful illustration of this mecha¬ 
nism at v 7 ork [o]. This binary system has a period of 1.7 days and the X-rays 
are pulsed with a period of just over 1.2 sec; in the range of (radio) pulsars. 
This pulsation presumably reflects the rotation of a neutron star, and has 
allowed the extraction of the velocity of the secondary through the periodic 
Doppler shift exhibited by the pulsing: it is 169 km s [12]. Optical 
Doppler studies of the primary (HZ Herculis) yield a primary velocity of 
«« 80 km s"^ , tl3] . 


From these numbers we can work out at once the masses of both components. 
Assuming small eccentricity, as indicated by the details of the Doppler shifts, 
we have 

M^v^ GM^M2 M^v 

where r , r are the distances from the centre of mass of the system. We 
1 ’ 2 

also have 


GM^M^ 




(10.4,2) 



(10.4.3) 


where t is the period. Then 

Ml = 3.6 X 10^® gm (1,8 M^) 

M 2 = 1,8 X 10®® gm (0.9 M^) . (10,4,4) 

The mass of the primary thus extracted is in agreement with the appearance of 
the primary HZ Herculis and the mass of the secondary is indeed right in the 
middle of the range of masses expected for neutron stars. 

The distance of Hercules X-1 is light years and the X-ray luminosity is 

10^^ erg s“^. This eclipsing system has a further amusing feature. The pri¬ 
mary is brighter and bluer on the side of the X-ray source, becoming dimmer and 
redder when the primary is eclipsing the secondary, [l4]. The atmosphere is 
presumably heated by the impact of X-rays from Hercules X-1 . 


The X-ray source Centaurus X-3 has similar features [q]. The period is 2.1 
days and the X-rays are pulsed with a period of 4.84 seconds: presumably the 
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accreting secondary is another neutron star. 

We cannot of course conclude that Cygnus X-1 contains a black hole. It is not 
impossible that the accreting object is less massive than 5M^. For example, 
perhaps it is a neutron star in orbit about a star of mass 5M^, which in turn 
is orbiting the primary, although this seems unlikely. However, even if the 
accreting mass is all the activity the X-ray telescopes are looking at 

is occurring a hundred kilometres out and the hypothetical critical radius is 
at '^15km. The critical radius ^diich is the characteristic distinguishing 
the fields of black holes from other sources of strong gravitational fields is 
thus, at least for the moment, beyond observation. 

10.5 The universe 

On a large scale the imiverse is of roughly constant density. If it is 
both big enough and dense enough the gravitational field will make escape of 
light to infinity impossible and the universe would then be closed. For a 
proper discussion of cosmology it is necessary to use the solutions of 
Einstein’s field equations in the presence of a uniform density of matter [l5]: 
here we merely note that the condition the universe be closed is very approxi- 
mately 

1. (10.5.1) 

The radius of the universe should be taken as «10^® cm , corresponding to ex¬ 
pansion at approximately the velocity of light for about 10^^ years. In order 
to satisfy Eq. (10.5.1) we need a density lO"^® gm cm"®. However the best 

estimates of the density currently available yield p » 10“^^ gm cm"®, and it 
seems that the left-hand side of Eq. (10.5.1) is at least an order of magni¬ 
tude less than unity, implying an open universe rather than a universal black 
hole, [16]. 
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